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MULTIVALENT AND MOLTIS PECIFIC ANTIGEN- BIRD TNG PROTEIN 
i?TPT.n 9? TUB INVENTION 

5 The present invention relates to multivalent and 

multispecif ic antigen binding proteins, methods for their 
1 production and uses thereof. In particular, the 
invention relates to binding proteins comprising 
polypeptides which associate to form multivalent or 
10 multispecif ic multimers. 

BJVCKqRQTOD 9* THE IflVEqmQK 

Antibodies are pr otein mol ecules having a structure based 

15 on a unit compris ing four polypeptides , two identical 
heavy chains and t wo identical light chains , which are 
covalently linked together by disulphide bonds. Each of 
these chains is folded in discrete domains. The C- 
terminal regions of both heavy and light chains are 

20 conserved in sequence and are called the constant 

regions, comprising one or more so-called C-domains. The 
N- terminal regions of the heavy and light chains, also 
known as V- domains, are variable in sequence and 
determine the specificity of the antibody. The regions 

25 in the variable d omains of the light and heavy ch ains (V L , 
and V B r espectively) responsible for antige n bin ding 
activity are known as the hvpervariable or 
compleme ntarity determining regions (CPR ) . Natural 
antibo dies have at least two identical antigen-binding 

30 sit es defined by the ass ociati on of the heavy and light 
ch ain variable regions. 

It is known that proteolytic digestion of an antibody can 
lead to the production of antibody fragments. Such 
35 fragments, or portions, of the whole antibody can exhibit 

antigen binding activity. An example of a binding 
fragment is an fragment which comprises a light chain 
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associated with the V H and domains of a heavy chain. 
The bivalent F(ab 1 ) 3 fragment comprises two such 
fragments connected together via the hinge region, giving 
two antigen binding sites. F v fragments, consisting only 
5 of the V-domains of the heavy and light chains associated 
with each other may also be obtained. These F v fragments 
are monovalent for antigen binding. Smaller fragments 
such as individual V-domains (domain antibodies or dABs, 
Ward et al Nature, 341 , 544 (1989) and individual CDR's 
10 (Williams et al, Proc. Natl. Acad. Sci, USA, ££, 5537 

(1989)) have also been shown to retain the binding 
characteristics of the parent antibody although generally 
most naturally occurring antibodies need both a V H and V L 
to retain full immunoreactivity . 

15 

Antibody fragments comprising V K and V L domains associated 
together to have antigen binding activity have also been 
described. The single chain F v fragment (scFv) comprises 
a V H domain linked to a V L domain by a flexible 
20 polypeptide linker such that the domains can associate to 
form an antigen binding site (see, for example, BP-B- 
0281604, Bnzon Labs Inc) . 

Microbial expression systems for producing active 
25 antibody fragments are known in the literature. The 
production of Fab in various hosts such as E.coli. 
(Better et al. Science, 240 . 104, (1988)), yeast (Horwitz 
et al, Proc. Natl. Acad. Sci, US4, Q5, 8678 (1988)) and 
the filamentous fungus Trichoderma reesei (Nyyssdnen et 
30 al, Bio/Technology, H, 591 (1993)) have previously been 
described, for example. It is also known that plants can 
be used as hosts for the production of SCFv fragments 
(Owen et al, Bio/Technology, Ifi, 790 (1992)) as well as 
whole antibodies* 

35 

^advjauit^ge^Qj Lusing antibody fragments rathpr fhan ., 
whole antibodies in diagnosis and therapy lies in their 
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smaller size. They are likely to be less immunogenic 
than whole antibodies and more able to penetrate tissue. 
A disadvantage associated with the use of fragments such 
as the F^, F v , and S C F V antibody fragments described above, 
5 however is that they have only one binding site for 
antigen binding as compared to the two or more sites 
contained in the whole antibody, preventing polyvalent 
binding to the antigen and hence leading to reduced 
avidity. 

10 

In an attempt to overcome this problem, attention has 
been directed to providing multivalent antigen binding 
proteins, that is binding proteins having more than one 
antigen binding site. In addition, there has been 

15 interest in producing antigen-binding proteins having 
multiple specificities capable of binding to different 
antigenic determinants and containing antigen binding 
domains derived from different sources. Antigen-binding 
proteins having distinct binding specificities may be 

20 useful, for example, in targeting effector cells to 

target cells by virtue of the specific binding of the 
different binding domains. By way of illustration, a 
bispecific antigen binding protein having specificity for 
both tumour cells and cytotoxic drugs may be used to 

25 target specifically cytotoxic drug to tumour cell in an 
efficient manner. By avoiding the need for chemical 
modification, adverse immune responses may be avoided. 

Hitherto, the potential application of multivalent and 
30 multispecific antigen binding proteins have been hindered 
by the difficulties in generating and purifying such 
molecules . 

Recombinant antigen-binding proteins having two binding 
35 sites may be prepared by methods such as chemical cross - 
linking of cysteine residues, either through cysteine 
residues introduced at the C- terminus of the V K of an F v 



WO 97/38102 



PCT/EP97/01609 



(Cumber et al, J. Immunol., 149 . 120 (1992)), through the 
hinge cysteine residues in to generate (Fab 1 ) 2 (Carter 
et al, Bio/Tech., i£), 163 (1992)) or at the C-terminus of 
the V L of an scPv (Pack and Plilckthun, Biochemistry, H, 
5 1579 (1992)), Alternatively, the production of bivalent 
and bispecific antibody fragments based on the inclusion 
of fragments of C- terminal peptide sequences which 
promote dimerisation has been described. (Kostelny et 
al, J. Immunol., 148 . 1547). 

10 

Bivalent or bispecific antibody fragments comprising a 
binding complex containing two polypeptide chains, one 
comprising two heavy chain variable domains (V H ) in series 
and the other comprising two light chain variable domains 
15 (V L ) in series are described in our pending European 

Patent Application No. 95307332.7. 

Multivalent and/or multispecif ic antibody fragments are 
described in WO 94/09131 (Scotgen Limited) . Specific 

20 binding proteins having two binding regions, contained at 
least in part on first and second polypeptide chains 
which chains additionally incorporate associating domains 
capable of binding to each other causing the polypeptide 
chains to combine are disclosed therein. It is disclosed 

25 that the first and second binding regions preferably are 
antibody antigen-binding domains, for example comprising 
V H and V L regions contained in a Fab fragment or in a 
single -chain Fv fragment, or may be derived from just one 
of the V H or V L regions of an antibody. The associating 

30 domains may suitably be derived from an antibody and may 
be inter alia antibody V H and V L regions. It is further 
disclosed that using a V H /V L domain combination to achieve 
association leads to the creation of a supplementary Fv 
domain such that the antibody produced may be trivalent. 

35 Schematic representations of the arrangements suggested 
in WO 94/09131 to produce trivalent fragments are shown 
in Figure 1A. 
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WO 93/11161 (Enzon Inc) describes multivalent antigen- 
binding proteins comprising two or more single -chain 
protein molecules, each single chain molecule comprising 
first and second polypeptides each comprising the binding 
5 portion of the variable region of an antibody heavy or 
light chain with the polypeptides being linked together 
via a peptide linker. Hypothetical trimers and tetramers 
are discussed, comprising three or four single-chain 
antigen binding proteins as appropriate. Schematic 
10 representations of the trivalent arrangements suggested 
are shown in Figure IB. 

WO 91/19739 (Celltech Limited) discloses multivalent 
antigen binding proteins comprising an Pv fragment bound 
to at least one further Pv fragment by a connecting 
structure which links the Pv fragments together but which 
maintains them spaced apart such that they can bind to 
adjacent antigenic determinants. Conveniently the 
connecting structure consists of a spacing polypeptide 
and a linkage unit such as a cross -linking maleiraide 
linker or a molecule which allows for non-covalent 
binding. Particularly preferred connecting structures 
which are disclosed are based on antibody joining and 
hinge region sequences. 

SUMMARY OF THE INVENTION 

According to the present invention there is provided a 
multivalent antigen binding protein comprising: 

a first polypeptide comprising in series, three or 
more variable domains of an antibody heavy chain; 
and 



15 



20 



25 



35 



a second polypeptide comprising, in series, three or 
more variable domains of an antibody light chain, 
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said first and second polypeptides being linked by 
association of the respective heavy chain and light chain 
variable domains, each associated variable domain pair 
forming an antigen binding site . 

5 

As used herein, the term multivalent means more than one 
antigen binding site. 

Preferably the first polypeptide comprises three variable 
10 domains of an antibody heavy chain and the second 
polypeptide comprises three variable domains of an 
antibody light chain, providing a trivalent protein. 

It will be appreciated that the polypeptides may comprise 
15 heavy or light chains, variable domains, as appropriate, 
or functional equivalents thereof. 

The respective heavy or light chain variable domains may 
suitably be linked without any intervening linker. 

20 According to a preferred embodiment, however, the 

variable domains contained in the individual polypeptides 
are linked by peptide linkers. Preferably the peptide 
linker is flexible, allowing the variable domains to flex 
in relation to each other such that they can bind to 

25 multiple antigenic determinants simultaneously . It will 
be appreciated that the binding of the linker to the 
individual heavy or light chain variable domains will be 
such that it does not affect the binding capacity of the 
binding site formed by the associated variable domain 

30 pair. Conveniently the peptide linker comprises from 16 
to 19 amino acid residues. A preferred, peptide linker 
for heavy chain domains is (Gly 4 Ser) 3 AlaGlySerAla and for 
the light chain domains is <Gly 4 Ser) 3 Val . 

35 It will be appreciated that if two or more of the 

associated variable domain pairs (V H /V L pairs) have the 
same antigen specificity, for example if they are derived 
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from the same parent antibody or fragment thereof or from 
different antibodies which bind the same epitope, then a 
binding protein which binds more than one molecule of the 
same type will be produced. 

According to one embodiment, where the binding protein 
according to the invention comprises three antigen 
binding sites which are able to bind different epitopes 
from each other, a trivalent trispecific protein is 
produced . 

In another embodiment, where the binding protein 
according to the invention comprises three associated 
variable domain pair binding sites, two of which sites 
bind the same epitopes, a trivalent, bispecific protein 
is provided. Where all three binding sites have the same 
antigen specificity, a trivalent, monospecific binding 
protein is provided. 

The invention also provides nucleotide sequences coding 
for the polypeptides of the multivalent antigen binding 
protein according to the invention and cloning and 
expression vectors containing such nucleotide sequences. 

The invention further provides host cells transformed 
with vectors containing such nucleotide sequences and 
methods of producing such polypeptides by expression of 
the nucleotide sequences in such hosts. 

The invention further provides a process for preparing a 
multivalent antigen binding protein as set forth above 
comprising : 

(i) transforming one or more hosts by incorporating 
genes encoding said first and second 
polypeptides; 

(ii) expressing said genes in said host or hosts; 
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(iii) allowing said first and second polypeptides to 
combine to form the antigen binding protein. 



Suitably the host or hosts may be selected from 
5 prokaryotic bacteria, such as Gram-negative bacteria, for 
exanqple E.Coli, and Gram-positive bacteria, for example 
B. subtilis or lactic acid bacteria, lower eukaryotes 
such as yeasts, for example belonging to the genera 
Saccharomyces Kluyveromyces or Trichoderma, moulds such 

10 as those belonging to the genera Aspergillus and 

Neurospora and higher eukaroytes, such as plants, for 
example tobacco, and animal cells, examples of which are 
myeloma cells and CHO, COS cells and insect cells. A 
particularly preferred host for use in connection with 

15 the present invention is COS (monkey kidney) cells. 

Techniques for synthesising genes, incorporating them 
into hosts and expressing genes in hosts are well known 
in the art and the skilled person would readily be able 
20 to put the invention into effect using common general 
knowledge. Proteins according to the invention may be 
recovered and purified using conventional techniques such 
as affinity chromatography, ion exchange chromatography 
or gel filtration chromatography. 

25 

The activity of the multivalent binding proteins 
according to the invention may conveniently be measured 
by standard techniques known in the art such as enzyme- 
linked immunosorbant assay (BLISA) , radioimmune assay 
30 (RIA) or by using biosensors. 

The multivalent antigen binding proteins of the present 
invention may suitably be used in diagnostics or therapy 
for example in targeting a tumour cell with natural 
35 killer cells and cytotoxic agent. Other uses for which 
the multivalent binding proteins according to the 
invention are useful include those uses for which 
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antibodies or fragments thereof are commonly used, 
including for immunoassays and in purification. 
According to a particular preferred embodiment, multi- 
enzyme complexes may be assembled, at a target, for 
example a cell surface. As an illustration, multivalent 
binding proteins according to the invention may be used 
to target cell killing enzymes such as an oxidase (for 
example glucose oxidase) and peroxidase (for example 
horseradish peroxidase) to a target species which is am 
antigenic component of dental plaque, such as S. sanguis 
or S. mutans. Complexes comprising enzyme, coenzyme and 
target antigen may also conveniently be assembled. 

Accordingly, the invention also provides compositions 
comprising the multivalent antigen binding proteins 
according to the invention, conveniently in combination 
with a cosmetically or pharmaceutically acceptable 
carrier, diluent or excipient. Methods of treatment 
using the multivalent antigen binding proteins according 
to the invention are also provided. 

For use in diagnosis or therapy, the multivalent antigen 
binding proteins according to the invention may 
conveniently be attached to an appropriate diagnostically 
or therapeutically effective agent or carrier by methods 
conventional in the art. 

An advantage of using multivalents antigen binding 
proteins according to the invention over multivalent 
binding proteins prepared by existing techniques known in 
the art is that the "self -assembling" association of the 
respective heavy and light chain variable domains to form 
the multivalent binding sites avoids the need for 
chemical coupling steps or the introduction of linking 
residues to stabilise the multivalent constructs, thereby 
minimising the risk of eliciting an immune response to 
such molecules when the resulting multivalent binding 
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proteins are used in therapy. 



A particular advantage of molecules according to the 
present invention is that they may conveniently be 
5 purified straight from the supernatant using conventional 
purification techniques. As they are self -assembling, 
there is no need to purify individual subunits prior to 
coupling as in existing techniques. 

10 The present invention may be more fully understood with 
reference to the following description, when read 
together with the accompanying drawings. 



15 



20 



BRIEF DESCRIPTION O F THE DRAWINGS 

Figures 1A and IB show schematic representations of 
published arrangements of heavy and light chain V-domain 
gene fragments that have been suggested to produce 
trispecific or trivalent antibody fragments: 



A) scFvl-VLa + scFv2-VHa (2 chains) WO 94/09131 

B) Fabl-Vla + Fab2 - VHa (4 chains) WO 94/09131 

C) scFvl-VLa-CLa + scFvl-VHa-CHa (2 chains) WO 94/09131 

D) Fabl-VLa-CLa + Fab2-VHa-CHa (4 chains) WO 94/09131 
25 B) scFvl + scFv2 + scFv3 (3 chains) WO 93/11161 

F) VH1-VL2 + VH2-VL3 + VH3-VH1 (3 chains) WO 93/11161 



Figure 2A/B shows the nucleotide sequence of the EcoRI- 
Hindlll insert of pG0SA.E2t containing DNA encoding pelB 
30 leader- VH4715 -linker- VL3418 and DNA encoding pelB leader- 
VL3418-linker-VH47l5-hydrophil2 tag (SEQ ID No. 1) . 



Figure 3 

35 A) shows the nucleotide sequence of the Hindlll-EcoRI 

insert of plasmid scFv.Lys with DNA encoding pelB leader- 
VHLys-1 inker- VLLys (SEQ ID No. 2). 
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B) shows the nucleotide sequence of the Hindlll-EcoRI 
insert of plasmid scFv.4715.2t with DNA encoding pelB 
leader- VH4715.2t (SEQ ID No. 3). 

5 Figure 4 shows the nucleotide sequence of the genomic 
leader sequence of the anti-NP antibody (Jones et al, 
Nature, 321 . 522) . Exon sequences are indicated with 
shaded boxes. Ncol and PstI restriction sites are in 
bold and underlined (SEQ ID No. 4) . 

10 

Figure 5 gives a schematic representation of the 
eukaryotic expression vector pSV.51. 

Figure 6 gives an overview of the pUC19 double head (A) 
15 and triple head (B) constructs. The position of the 
oligonucleotides and the restriction sites used for 
assembling double and triplehead pUC constructs are 
indicated. 

Figure 7 

A) shows the origin of the VH-C- linker and VL-C-linker 
fragments. 

B) gives a schematic representation of the construction 
of the pUC.19-triple-head vectors. 

Figure 8 

A) gives a schematic representation of the construction 
of the Euka.VH and Buka. VL vectors. 

B) gives a schematic representation of the construction 
of the pSV.VH expression vectors. 

C) gives a schematic representation of the construction 
of the pSV.VL expression vectors. 
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Figure 9 shows the expression of the trispecif ic Golysan 
proteins on an SDS-PAGE gel containing total COS culture 
supernatant. Crude supernatants of COS cells transfected 
with pSV expression vectors were separated on SDS-PAGE 
5 gels. The proteins were transferred onto a 

nitrocellulose membrane and the VH3 and VL3-2t were 
detected using anti-VH and anti-hydrophil 2 tag specific 
monoclonal antibodies respectively. (A=anti-Hydro-II, 
B=anti-Hydro-II + anti-VH) Samples: M) Low Molecular 
10 Weight Markers, 1) pSV.K + pSV.V,2) pSV.K + pSV.W,3) 
pSV.M + pSV.V,4) pSV.M + pSV.W. 

Figure 10 shows the results of three ELISA's. Lysozyme, 
Glucose oxidase and S. sanguis binding activity was 
15 determined in crude COS supernatants by measuring 1) 

Lysozyme -Glucose oxidase (=LYSOX) / 2) Glucose oxidase- 
S. sanguis (=GOSA) and 3) Lysozyme -S. sanguis (= LYSAN) 
bispecific binding activities. 

20 Figure 11 shows the results of three ELISA's. Lysozyme, 
Glucose oxidase and S. sanguis binding activity of 
purified Golysan. A (A) and Golysan. B (B) was determined 
by measuring 1) Lysozyme -Glucose oxidase (=LYSOX) , 2) 
Glucose oxidase- S. sanguis (=G0SA) and 3) Lysozyme- 

25 S. sanguis (= LYSAN) bispecific binding activities. 

Figure 12 shows the nucleotide sequence of the EcoRI- 
Hindlll insert of pUR.4124 containing DMA (see SBQ ID NO: 
23) encoding V L Lys- Linker- V H Lys. 

30 

Figure 13 shows the nucleotide sequence of the Hindlll- 
EcoRI insert of plasmid Fv.3418 (see SEQ ID NO: 24) 
containing DNA encoding pelB leader- V H 3418 and pelB 
leader-V L 3418 . 

35 

Figure 14 shows the nucleotide sequence of the Hindi II- 
EcoRI insert of plasmid Fv.4715-myc (see SEQ ID NO: 25) 
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containing DNA encoding pelB Ieader-V H 4715 and pelB 
Ieader-V L 4715-Myc tag. 

Figure 15 shows the nucleotide sequence of the Hindlll- 
5 EcoRI insert of scFv.4715-myc containing DNA (see SEQ ID 
NO: 26) encoding pelB leader-V H 4715- Linker- V L 47 15 -Myc tag. 

Figure 16a/b shows the nucleotide sequence of the 
Hindlll-EcoRI insert of pGOSA.E (see SEQ ID NO: 27) 
10 containing DNA encoding pelB leader-V H 4715 -Linker- V L 3418 
and pelB leader-V L 3418-Linker-V„4715 . 

Figure 16c gives an overview of the oligonucleotides and 
their positions in pGOSA.E that can be used to replace V- 
15 domain gene fragments. 

Figure 17 shows the .construction of plasmid pGOSA.A. 

Figure 18 shows the construction of plasmid pGOSA.B. 

20 

Figure 19 shows the construction of plasmid pGOSA.C. 
Figure 20 shows the construction of plasmid pGOSA.D. 
25 Figure 21 shows the construction of plasmid pGOSA.E. 

Figure 22 shows the source of fragment PCR.I BstEII/SacI. 
Figure 23 shows the source of fragment PCR.IV Xhol/EcdRI. 

30 

Figure 24 shows the source of fragment PCR.V Sall/EcoRI. 
Figure 25 shows the source of fragment PGR. Ill Nhel/Sacl. 
35 Figure 26 shows the source of fragment PCR.II Sfil/EcoRI. 
Table 1 shows the nucleotide sequence of all 
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oligonucleotides used in the construction of the 
described double and triple head constructs. 

Table 2 lists all pSV expression constructs described 
5 in this specification. 

The following examples are provided by way of 
illustration only: 

10 BSAMgLBS 

General BxperjLwQBtfli 

Strains, Plasmids and Media 

15 All cloning steps were performed in E.Coli JM109 or 
E.Coli XL-1 Blue. Cultures were grown in 
2xTY/Arap/Glucose medium (16g tryptone, lOg yeast extract, 
5g NaCL per liter H a O supplemented with 2% glucose and 
100^g/ml ampicillin) . Transformations were plated out on 

20 SOBAG plates (20g tryptone, 5g yeast extract, 15g agar, 
0.5g NaCl per liter H 3 0 plus lOmM MgCl 2 , 2% glucose, 
100/ig/ml ampicillin) . The bicistronic E.coli vectors 
used are derivatives of pUC19. The COS expression vector 
pSV*51 (LMBP strain nr 1829) was obtained from the LMBP 

25 Culture collection (Laboratory of Molecular Biology 
University Gent). COS-1 cells (ECACC No: 88031701; 
African green monkey kidney cells) were obtained from the 
European Collection of Animal Cell Cultures (ECACC) . All 
tissue culture reagents were from Gibco BRL (Life 

30 Technologies, Paisley, UK) 

DNA Manipulations 
Oligonucleotides and PGR 

The oligonucleotide primers used in the PCR reactions 
35 were synthesized on an Applied Biosystems 381A DNA 

Synthesiser by the phosphoramidite method. The primary 
structures of the oligonucleotide primers used in the 
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construction of the trispecific pSV constructs (Table 2) 
are shown in Table 1. Reaction mixture used for 
amplification of DNA fragments were lOmM Tris-HCl, pH8.3, 
2.5tnM MgCl a , 50mM KC1, 0*01% gelatin (w/v) , 0.1% Triton X- 
5 100, 400mM of each dNTP, 5.0 units of Vent DNA polymerase 

(New England Biolabs) , lOOng of template DNA, and 500ng 
of each primer (for 100/il reactions) . Reaction 
conditions were: 94°C for 4 minutes, followed by 33 
cycles of each 1 minute at 94 °C, 1 minute at 55 °C, and 1 
10 minute 72 °C. 

Plasmld DNA\Vector\Inaert preparation and 
ligation\ transformation. 

Plasmid DNA was prepared using the 'Qiagen P-100 and P- 

15 500 Midi/Maxi-DNA Preparation 1 system. Vectors and 

inserts were prepared by digestion of lOjig (for vector 
preparation) or 20/xg (for insert preparation) with the 
specified restriction endonucleases under appropriate 
conditions (buffers and temperatures as specified by 

20 suppliers) . Klenow fill-in reactions and 

dephosphorylation with Calf Intestine Phosphorylase were 
performed according to the manufacturers instructions. 
Vector DNA's and inserts were separated through agarose 
gel electrophoresis and purified with DEAE-membranes NA45 

25 (Schleicher & Schnell) as described by Maniatis et al. 

(Molecular cloning: a Laboratory manual, Cold Spring 
Harbour, N. Y. (1982)) Ligations were performed in 20jil 
volumes containing 30ntM Tris-HCl pH7.8, 10nM MgCl 2 , lOmM 
DTT, lmM ATP, 300-400ng vector DNA, 100-200ng insert DNA 

30 and 1 Weiss unit T 4 DNA ligase. After ligation for 2-4 h 
at room temperature, CaCl 2 competent E. coli JM109 or XL-1 
Blue (Maniatis et al) were transformed using 7.5^1 
ligation reaction. The transformation mixtures were 
plated onto SORAG plates and grown overnight at 37 °C. 

35 Correct clones were identified by restriction analysis 
and verified by automated dideoxy sequencing (Applied 
Biosys terns) . 
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Restriction digestion of PCR products 

Following amplification each reaction was checked for the 
presence of a band of the appropriate size by agarose gel 
electrophoresis. One or two 100^1 PCR reaction mixtures 
5 of each of the PCR reactions, together containing 

approximately 2-4/ig DNA product were subjected to phenol - 
chloroform extraction, chloroform extraction and ethanol 
precipitation. The DNA pellets were washed twice with 
70% ethanol and allowed to dry. Next, the PCR products 
10 were digested overnight (18 h) in 200/iL IxBuffer with 
excess of the appropriate restriction enzyme. 

Trans formation of COS Cells 

Cos-1 cells were maintained in DMEM culture medium with 

15 glutamine (2mM) , Penicillin (lOOU/mL) # streptomycin 
(100/ig/tnL) containing 10% F.C.S. For transient 
transfection assays JL-3 xlO 5 COS-1 cells were seeded in 3 
cm-diameter tissue culture dishes (2mL) . The cells were 
incubated at 37 °C in a C0 2 incubator until cells were 50- 

20 80% confluent (overnight) . For each transfection the 
following mixes were prepared: A) l/ig of each of the 
specified DNA 1 s in 100/iL Opti-MBM-I Reduced Serum Medium, 
B) 1/iL Lipof ectAmine in 100/iL Opti-MEM-I Reduced Serum 
Medium. Mixes A and B were combined (gently) . After 

25 allowing the DNA-liposome complexes to form for 30-45 
minutes at room temperature, 0.8mL Opti-MEM-I Reduced 
Serum Medium was added to each lipid DNA complex 
containing tube. The COS-1 cells were washed once with 
2mL of Opti-MEM-I Reduced Serum Medium and overlayed with 

30 the diluted complex solution. The COS-1 cells were 

incubated for 5 hr at 37°C. Following incubation, 2mL 
growth medium was added. 20 hours following transfection 
the medium was replaced with 2mL fresh growth medium 
containing O.lmM Na-butyrate. After 48 hours incubation 

35 at 37°C the supernatant was harvested and assayed for the 
presence of antibody fragments. 
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ELISA 

A) QOSA: Glucose Oxidase and S. sanguis binding activity 

96 well ELISA plates (Greiner HC plates) were activated 
overnight at 37 °C with 200/il/well of a 1/10 dilution of 
an overnight culture of Streptococcus sanguis cells in 
0.05M sodium carbonate buffer pH9.5 was used to sensitise 
each well. Following one wash with PBST, the antigen 
sensitised plates were pre-blocked for 1 hour at 37°C 
with 200/il/well blocking buffer (1% BSA, 0.15% Tween in 
PBS) . 50/il COS culture supematants (neat or diluted 
with PBS) plus 50/xl blocking buffer containing glucose 
oxidase (50/xg/ml) was added to the Streptococcus Sanguis 
sensitised plate and incubated for 2 hours at 37 °C. 
Following 4 washes with PBS-T, bound glucose oxidase was 
detected by adding lOOjzl substrate to each well (70mM Na- 
citrate, 320mM Na-phosphate, 27mg/ml glucose, 0.5/ig/ml 
HRP, lOOjig/ml 1MB) . The colour reaction was stopped 
after 1 hour by the addition of 35/il 2M HC1 and the A450 
was measured. 

B) LYSOX: Lysozyme and Glucose Oxidase binding activity 
96 well ELISA plates (Greiner HC plates) were activated 
overnight at 37°C with lysozyme (50/xg/mL in 0.05M sodium 
carbonate buffer pH9.5; 200^1/well) . Following one wash 
with PBST, the antigen sensitised plates were pre-blocked 
for 1 hour at 37°C with 200/il/well blocking buffer (1% 
BSA, 0.15% Tween in PBS) . 50j*l COS culture supematants 
(neat or diluted with PBS) plus 50/il blocking buffer 
containing glucose oxidase (50jig/ml) was added to the 
Streptococcus Sanguis sensitised plate and incubated for 
2 hours at 37°C. Following 4 washes with PBS-T, bound 
glucose oxidase was detected by adding lOOjil substrate to 
each well (70mM Na-citrate, 320mM Na-phosphate, 27mg/ml 
glucose, 0.5/ig/ml HRP, 100/ig/ml TMB) . The colour 
reaction was stopped after 1 hour by the addition of 35/xl 
2M HC1 and the A450 was measured. 
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C) LYSAN: S. sanguis and Lysozyme binding activity 
96 well BLISA plates (Greiner HC plates) were activated 
overnight at 37°C with 200/il/well of a 1/10 dilution of 
an overnight culture of Streptococcus sanguis cells in 
5 0.05M sodium carbonate buffer pH9.5 was used to sensitise 
each well. Following one wash with PBST, the antigen 
sensitised plates were pre-blocked for 1 hour at 37°C 
with 200/il/well blocking buffer (1% BSA # 0.15% Tween in 
PBS) . 50/il COS culture supernatants (neat or diluted 

10 with PBS) plus 50/il blocking buffer was added to the 

Streptococcus Sanguis sensitised plate and incubated for 
2 hours at 37 °C. Following 4 washes with PBS-T, 50/iL 
blocking buffer containing Alkaline- Phosphatase 
conjugated Lysozyme (100/i/mL) . Unbound Lysozyme was 

15 removed by 4 washes with PBS-T. Bound Lysozyme was 

detected by adding 100/iL substrate solution to each well 
(lmg/ml pNPP in 1M diethanolamine , ltnM MgCl 2 ) . After 1 
hour the A405 was measured. 

20 gv&MPT.g i s construc tion of the pSV.Golvsan expression 

vector 

The construction of the pSV COS expression vectors 
consisted of three stages: 

IA) : Assembly of 2 heavy chain variable domains and 2 
light chain varia ble domains in a pUC based E.Coli 
expressio n vector thus constructing th e VH A -VH„ and 
VL a -VLb modules respective ly. 

IB) : Assembly of 3 heavy chain variable domains and 3 
light chain variable domains i n a pUC based E.Coli 
expression vector thus constructing the VHa-VHb-VHc 
and VI^-VLb-VLc modules respectively. 

2) Linking the VH A -VH B( VH^VHb-VHc and VL^VLb, VL^y!^- 
VLc to the genomic anti-NP leader sequence in the 



25 



30 



35 
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intermediate "EUKA" vectors to ensure efficient 
secretion by COS cells. 

3) Inserting the leader-VX^-VH^ leader-VH A -VH B -VHc and 
leader -VLa-VLb , leader- Vh^VJ^-Vl^ as Xbal/Xbal 
fragments downstream of the SV40 promoter in the COS 
expression vector pSV.51. 

ad. 2 J E.coli expression vectors. 

The E.coli expression vectors are derivatives of pUC.19 
containing a Hindi II -EcoRI fragment that in the case of 
the scFv.lys-myc contains a pelB signal sequence fused to 
the 5» end of the heavy chain V-domain that is directly 
linked to the corresponding light chain V-domain of the 
antibody through a connecting sequence that codes for a 
flexible peptide (Gly 4 Ser) 3 thus generating a single-chain 
molecule. In the 'double head 1 expression vector both 
the heavy chain and the light chain V- domains of the 
antibody are preceded by a ribosome binding site and a 
pelB signal sequence in an artificial dicistronic operon 
under the control of a single inducible promoter. 
Expression of these constructs is driven by the inducible 
lacZ promoter. The nucleotide sequence of the Hindlll- 
BcoRI inserts of the scFv.lys-myc, scPv.47l5.2t and 
pGOSA.E2t constructs used for the generation of the 
trispecific antibody fragments are listed in Figures 3 
and 2 respectively. 

ad.lAj Assembly of bi- specific fragments or double heads. 

The construct pGOSA.E2t (Figures 2 and 6A) is derived 
from the E.coli expression construct pGOSA.E. The 
construction of pGOSA.E has been described in detail in 
preparation 1 below. 

In contrast with pGOSA.E, pGOSA.E2t contains a peptide 
tag at the C- terminus of the Variable light chain. Using 
oligonucleotides DBL3 and DBL.4 the VL4715 gene fragment 
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was amplified using scFv.47l5.2t as a template. The 
Sall/BamHI VH4715.2t PCR fragment and the Hydrophil-2 tag 
containing BamHI/EcoRI fragment from scPv.47l5.2t (Figure 
3B) were used to replace the Sall/EcoRI VH4715 fragment 
5 in pGOSA.E thus producing pGOSA.E2t. 

The vector pGOSA.E2t and the oligonucleotides in Table 1 
have been designed to enable most specificities to be 
cloned into the pGOSA.B2t construct (Figure 6A) . The 

10 upstream V H domain can be replaced by any Pstl-BstEII V H 
gene fragment obtained with oligonucleotides PCR. 51 and 
PCR. 89. The oligonucleotides DBL.l and DBL.2 were 
designed to introduce Sf il and Nhel restriction sites in 
the V H gene fragments thus allowing cloning of those V H 

15 gene fragments into the Sfil-Nhel sites as the downstream 
V H domain. Using this approach the following VH a -VHb 
combinations were constructed: VH4715-VH3418, VH4715- 
VHlys, VH3418-VHlys, VHlys-VH3418 . 

20 All V L gene fragments obtained with oligonucleotides 

PCR. 116 and PCR. 90 can be cloned into the position of the 
3418 V L gene fragment as a Sacl-Xhol fragment. A 
complication here however is the presence of an internal 
SacI site in the 3418 V H gene fragment. Oligonucleotides 

25 DBL.3 and DBL.4 are designed to allow cloning of V L gene 
fragments into the position of the 4715 V L gene fragment 
as a Sall-BamHI fragment. A complication here however is 
the presence of an internal BamHI site in the hydrophil- 
2-tag gene fragment (2t) . Using this approach the 

30 following VL A -VL„ combinations were constructed: VL3418- 
VL4715.2t, VLlys-VL4715.2t and VLlys-VL3418 . 2t . 

ad. IB J Assembly of tri- specific fragments or triple 
heads. 

35 Amplification of the VH-linker fragments using either 

scFv (VH- linker- VL) or bi-specific constructs (VH-linker- 
VH) as template with the primer combination DBL.1/DBL.5 
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(Figure 7A) yields one of the building blocks for the 
construction of the VH A -VH B -VHc modules. The VH-linker 
DBL . l/DBL . 5 PCR fragment is digested with Sfil and 
inserted into the Sfil site that is present between the 
5 linker sequence and the downstream VH domain in all bi- 
specific constructs (Figure 7B) thus producing a VH A -VH B - 
VHc module. Using this approach the following VH A -VH B -VH C 
combinations were constructed for this filing: VH4715- 
VHlys-VH3418 and VHlys-VH4715-VH3418 . 

10 

^Jgg ing a bi-specif ic construct , (VL-linker-VL) as the 
te mplate in an amplification reaction with the primer 
com bination DBL. 3 /DBL. 6, (Figure 7A) yields the VL- linker 
building block for the construction of the VL a -VLb-VL c 

15 modules. The VL-linker DBL. 3 /DBL. 6 PCR fragment is 

digested with Sail and inserted into the Sail site that 
is present between the linker sequence and the downstream 
VL domain in all bi- specific constructs (Figure 7B) thus 
producing a VLx-VLg-VLj. module. Using this approach the 

20 following VLa-VLb-VLc combinations were constructed: 
VLlys-VL4715-VL3418 . 2t and VL3418-VLlys-VL4715 . 2t . 

A schematic representation of the final tri- specific 
constructs is shown in Figure 6B. 

25 

ad. 2) Linking the variable region domains to the leader 
sequence. 

The Hindlll/BcoRI polylinker of pUC19 was replaced with a 
synthetic EcoRI/Hindlll *Buka* polylinker. This was 

30 achieved by annealing and inserting the synthetic 
oligonucleotides Euka.l and Euka.2 (Table 1) into 
EcoRI/Hindlll digested pUC19 vector. The resulting 
Buka.pUC vector contains all restriction sites needed for 
the subcloning of the leader sequence and the VH and VL 

35 domains. The NcoI/PstI genomic ant i -MP leader sequence 
fragment was cloned into the NcoI/PstI digested Buka.pUC 
vector yielding the Euka.VH construct (Figure 8A) . 
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Oligonucleotides ML.l and ML. 2 (Table 1) were used in an 
amplification reaction to introduce a SacI site at the 3* 
end of the leader sequence that allows the construction 
of leader-VL fusions. The NcoI/SacI leader sequence PCR 
5 fragment was inserted into NcoI/SacI digested Euka.pUC 
vector yielding the Euka.VL construct (Figure 8A) . 

The VH A -VH B and VH A -VH B -VHc modules were excised from the 
pUC expression vectors as Pstl/Nhel fragments and 

10 inserted into Pstl/Nhel digested Euka.VH vector (Figure 

8B) . Using this approach the following leader-VH A -VI^ and 
leader -VHa-VHb-VHc combinations were constructed for this 
filing: Euka.B: leader-VH4715-VH3418, Euka.D: leader- 
VH4715-VHlys, Euka.G: leader-VH3418-VHlys, Euka . K : 

15 leader-VH4715-VHlys-VH3418 and Euka.M: leader-VHlys- 

VH4715-VH3418. 

The VLa-VLb and VL^VLp-VLc modules were excised from the 
pUC expression vectors as EcoRI-Klenow/SacI fragments and 

20 inserted into Notl-Klenow/SacI treated Euka.VL vector 

(Figure 8C) . Using this approach the following leader- 
VL*- VLb and leader- VL^VLg-VLc combinations were 
constructed: Buka.N: leader-VL3418-VL4715 .2t f Euka.P: 
leader-VLlys-VL4715.2t Euka.S: leader- VLlys-VL34l8 .2t, 

25 Buka.V: leader-VLlys-VL4715-VL3418 . 2t and Euka.W: leader- 
VL3418-VLlys-VL4715 . 2t . 

ad. 3) Subcloning of leader-variable domain fusions into 
the pSV.Sl expression vector 

30 All leader- VH^VHb, leader-VH A -VHt-VHc # leader-VL^-VI^ and 
leader- VLa-VLb-VLc combinations were excised from the 
•Euka 1 vectors as Xbal/Xbal fragments and subcloned 
downstream of the SV40 promoter in pSV.51 (Figure 5) by 
insertion into the Xbal site (Figure 8B and 8C) . After 

35 confirmation of the correct orientation of the inserts 
the pSV expression vectors were used to transfect COS-l 
cells (see Example 2) . The pSV expression vectors used 
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are listed in Table 2. 

^f™!? 1 ^ 2: Bifunctional binding activity of fiflfl ypan 
triple heads 

This example describes the production of three types of 
bispecific binding activity by COS-1 cells transfected 
with expression plasmids encoding the corresponding VH X - 
VHp-VHc and VL a -VLb-VL c genes fragments. 

1. Production of antibody fragments by COS-1 calls 
Supernatants of COS-1 cells transfected with combinations 
of pSV-VH^VHs-VHc and pSV-VLx-VLe-VLc expression plasmids 
were separated on 10% SDS-PAGE and transferred onto 
nitrocellulose. The resulting Western blots were 
screened with a monoclonal antibody recognising a peptide 
sequence in framework 4 of the VH domains (region encoded 
by PCR.89: conserved in all used VH domains, {in-house 
reagent}) and/or a monoclonal specific for the hydrophil- 
2 tag. As shown in Figure 9 all supernatants contained 
products with the expected molecular weight of the VH A - 
VHb-VHc and VLa-VLb-VLc fragments, indicating that the COS 
cells were successfully tranfected and were secreting the 
produced antibody fragments into the culture medium at 
detectable levels. 

2. Bifunctional binding activity 

Supernatants of COS-1 cells transfected with single pSV 
expression plasmids and combinations of pSV expression 
plasmids were tested for the production of bifunctional 
binding activity using ELISA format: 

* Supernatants of COS-1 cells transfected with the 
bispecific positive controls 1 LYS AN 1 (pSV.D + pSV.P) , 
•LYSOX' (pSV.G + pSV.S) and 'GOSA' (pSV.B + pSV.N) only 
produced LYS AN, LYSOX and GOSA bispecific activity 
respectively (Figure 10) . No significant cross 
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reactivity was detected. 

* Supernatants of COS-1 cells transfected with only one 
expression vector encoding either one of the VH A -VH B -VHc 

5 fragments (pSV.K and pSV.M) or one of the VL^VLb-VLc 
fragments (pSV.V and pSV.W) did not exhibit any 
bispecific binding activity, indicating that no 
background binding or a specific binding activity is 
produced . 

10 

* All tested supernatants of COS-1 cells transfected with 
an expression vector encoding one of the VH A -VH B -VHc 
fragments (pSV.K and pSV.M) and an expression vector 
encoding one of the VL^-VI^-VLc fragments (pSV.V and 

15 pSV.W) showed significant levels of all three 

bifunctional binding activities LYSOX, GOSA and LYSAN. 

These results show that COS cells transfected with 
expression vectors encoding VH A -VH B -VHc and expression 

20 vectors encoding VLa-VLb-VLc fragments produce and secrete 
molecules that contain three binding activities. In this 
exampl e those three activities are: Glucose Oxidase 
binding, S. sanguis binding and Lysozyme binding . 
Furthermore, the results illustrated in Figure 10 clearly 

25 show that at least two of these binding activity are 
present in one self assembling molecular complex. In 
this example those combinations are: GOSA (Glucose 
Oxidase + S. sanguis), LYSOX (Lysozyme + Glucose Oxidase) 
and LYSAN (Lysozyme + S. sanguis) . 

30 

triple heads 

This example describes experiments that show that the 
35 three types of bispecific binding activity that are 
produced by COS-1 cells transfected with expression 
plasmids encoding the corresponding VH A -VH B -VH C and VL A - 
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VLb-VLc genes fragments are present in one self assembling 
molecular complex* 

Golysan.A (VHlys-VH4715-VH3418 + VLlys-VL4715-VL3418.2t) 
5 and Golysan.B (VHlys-VH4715-VH3418 + VL3418-VLlys- 
VL47l5.2t) was purified by affinity chromatography. 
100ml supernatant of COS-1 cells transfected with 
expression plasmids pSV.M/pSV.V (Golysan.A) or 
pSV.M/pSV.W (Golysan.B) were loaded onto a Lysozyme- 

10 Sepharose column (CNBr-Sepharose, Pharmacia; column was 
prepared according to the manufacturers instructions). 
After extensive washes with PBS the bound Golysan 
antibody fragments were eluted in 0.1M glycine buffer at 
pH=2.2. The fractions were neutralised with Tris and 

15 tested for the presence of trispecific binding activity. 

As shown in Figure 11 no bispecific binding activity was 
detect in the column fall- through. All three bispecific 
binding activities (GOSA, LYSOX and LYSAN) were extracted 

20 from the COS-1 supernatant by passing over the Lysozyme 

affinity matrix. After acid elution all three bispecific 
binding activities (GOSA, LYSOX and LYSAN) were recovered 
from the column. Since both Golysan.A and B were 
affinity purified based on the ability to bind to 

25 Lysozyme, the finding that these molecules also bind 
S. sanguis and Glucose Oxidase shows that all three 
binding activities are pres ent in one self assembling 
molecular complex. 

30 Preparation 1. 

Construction of the pGOSA.E double head expression vector 

In the pGOSA expression vectors, the DNA fragments 
encoding both the V H and of the antibody are preceded 
35 by a ribosome binding site and a DNA sequence encoding 
the pelB signal sequence in an artificial dicistronic 
operon under the control of a single inducible promoter. 
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Expression of these constructs is driven by the inducible 
lacZ promoter. The nucleotide sequence of the tfindlll- 
EcoRI inserts of the plasmids pUR.4124 (SEQ ID NO, 23), 
Fv.3418 (SEQ ID NO. 24), Fv.4715-rayc (SEQ ID NO. 25) and 
5 scFv.4715-myc (SEQ ID NO. 26) constructs used for the 

generation of the bispecific antibody fragments are given 
in Figures 12-15 , respectively. Moreover, a culture of 
E. coli cells harbouring plasmid scFv.4715-myc and a 
culture of E. coli cells harbouring plasmid Fv.3418 were 
10 deposited under the Budapest Treaty at the National 
Collection of Type Cultures (Central Public Health 
Laboratory) in London (United Kingdom) with deposition 
numbers NCTC 12916 and NCTC 12915, respectively. 

15 In agreement with Rule 28 (4) EPC, or a similar 

arrangement for a State not being a Contracting State of 
the EPC, it is hereby requested that a sample of such 
deposit, when requested, will be submitted to an expert 
only. 

20 

The construction of pGOSA.E (see Figure 16 for the 
Hindlll-BcoRI insert of pUC19) involved several cloning 
steps. The appropriate restriction sites in the various 
domains were introduced by PCR directed mutagenesis using 
25 the oligonucleotides listed in Table 1 below. 

The construction of pGOSA.E involved several cloning 
steps that produced 4 intermediate constructs pGOSA.A to 
pGOSA.D (see Figures 17-21) . The final expression vector 

30 pGOSA.E and the oligonucleotides in Table 1 have been 

designed to enable most specificities to be cloned into 
the final pGOSA.E construct (Figure 16c) . The upstream V H 
domain can be replaced by any Pstl-BstEIl V H gene fragment 
obtained with oligonucleotides PCR. 51 and PCR. 89 (see 

35 Table 1). The oligonucleotides DBL.l and DBL.2 (see 
Table 1) were designed to introduce Sfil and Nhel 
restriction sites in the V H gene fragments thus allowing 
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cloning of those V H gene fragments into the Sfil-Nhel 
sites as the downstream V H domain. All V L gene fragments 
obtained with oligonucleotides PCR.116 and PCR.90 (see 
Table 1) can be cloned into the position of the V^.3418 
5 gene fragment as a Sacl-Xhol fragment, A complication 

here however is the presence of an internal Sad site in 
the V H .3418 gene fragment. Oligonucleotides DBL.3 and 
DBL.9 (see Table 1) are designed to allow cloning of V L 
gene fragments into the position of the V t .47l5 gene 
10 fragment as a Sail- No tl fragment. 

pGOSA.A 

This plasmid is derived from both the Fv.4715-myc 
construct (SEQ ID NO. 25) and the scFv.4715-myc construct 

15 (SEQ IN NO. 26). An Sfil restriction site was introduced 

between the DNA sequence encoding the (Gly 4 Ser) 3 linker 
and the gene fragment encoding the of the scFv.47l5-myc 
construct (see Figure 17) . This was achieved by 
replacing the £s tEII -Sad fragment of the latter 

20 construct by the fragment PCR-I BstEII/SacI (Figure 22) 
that contains an Sfil site between the DNA encoding the 
(Gly 4 Ser) 3 linker and the V L .4715 gene fragment. The 
introduction of the Sfil site also introduced 4 
additional amino acids (AlaGlySerAla) between the 

25 (Gly 4 Ser) 3 linker and Vj,. 4715 resulting in a 

(Gly 4 Ser) jAlaGlySerAla linker (linkerA) . The 
oligonucleotides used to produce PCR-I (DEL. 5 and DBL . 7 , 
see Table l) were designed to match the sequence of the 
framework-3 region of V H .4715 and to prime at the junction 

30 of the DNA encoding the (Gly 4 Ser) 3 linker and the V t .4715 
gene fragment, respectively. Thus pGOSA.A can be 
indicated as: 

pel B - V H 47 15 - linkerA- (Sfil) -V L 4715-myc . 

35 pGOSA.B 

This plasmid is derived from plasmid Fv.3418 (see Figure 
18) . The JOjoI-EcoRI fragment of plasmid Fv.3418 
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coraprising the 3' end of DNA encoding framework-4 of the 
V L including the stop codon was removed and replaced by 
the fragment PCR-IV XhoI/EcoRI (Figure 23) . The 
oligonucleotides used to produce PGR- IV (DBL. 8 and DBL . 6 , 
5 see Table 1) were designed to match the sequence at the 
junction of the V L and the (Gly 4 Ser) 3 linker perfectly 
(DBL. 8) , and to be able to prime at the junction of the 
(Gly 4 Ser) 3 linker and the V H in pUR.4124 (DBL, 6) . DBL. 6 
removed the PstI site in the V H (silent mutation) and 
10 introduced a Sail restriction site at the junction of the 
(Gly 4 Ser) 3 linker and the V Hf thereby replacing the last 
Ser of the linker by a Val residue resulting in a 
(Gly 4 Ser) 2 Gly 4 Val linker (linkerV) . Thus pGOSA.B can be 
indicated as: 

15 pe2B-V a 3418 + pel B-V t 3 418 -linkerV- (Sail -BcoRI) . 

pGOSA.C 

This plasmid contains DNA encoding V H .4715 linked by the 
(Gly 4 Ser) jAlaGlySerAla linker to V M .3418 (see Figure 19), 
20 thus: 

pelB-V H 4715-linkerA-V H 3418 . 

This construct was obtained by replacing the Sfil-BcoRI 
fragment from pGOSA.A encoding V L .4715 by the fragment 

25 PCR-II Sfil/EcoRI containing the V H .3418 gene. The 

oligonucleotides used to produce PCR-II (DBL.l and DBL. 2, 
see Table 1) hybridize in the fratnework-1 and framework-4 
region of the gene encoding V H .3418, respectively. DBL.l 
was designed to remove the PstI restriction site (silent 

30 mutation) and to introduce an Sfil restriction site 
upstream of the V H gene. DBL. 2 destroys the BstEII 
restriction site in the framework-4 region and introduces 
an Nhel restriction site downstream of the stopcodon. 

35 pGOSA.D 

This plasmid contains a dicistronic operon coraprising the 
V„.3418 gene and DNA encoding V L .3418 linked by the 
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(Gly 4 Ser) 2 Gly 4 Val linker to V L .4715 (see Figure 20), thus: 
pe2B-V H 3418 + pelB-V L 3418-linkerV-V t 4715 . 

This construct was obtained by digesting plasmid pGOSA.B 
5 with Sall-EcoRI and inserting the fragment PCR-V 

Sall/EcoRI (Figure 24) containing the V^.4715 gene. The 
oligonucleotides used to obtain PCR-V (DBL . 3 and DBL.9, 
see Table l) were designed to match the nucleotide 
sequence of the framework- 1 and framework- 4 regions of 

10 the V L .4715 gene, respectively. DBL. 3 removed the Sad 
site from the framework- 1 region (silent mutation) and 
introduced a Sail restriction site upstream of the V^.4715 
gene. DBL.9 destroyed the Xhol restriction site in the 
framework-4 region of the V L .4715 gene (silent mutation) 

15 and introduced a NotI and an EcoRI restriction site 
downstream of the stop codon. 

pGOSA.E 

This plasmid contains a dicistronic operon comprising DNA 
20 encoding V H .4715 linked by the (Gly 4 Ser) ^AlaGlySerAla 

linker to V tt .3418 plus DNA encoding V L .3418 linked by the 
(Gly 4 Ser) 2 Gly 4 Val linker to V L .4715 (see Figure 21), thus: 
pelB-V B 4715-linkerA-V H 3418 + pelB-V Xi 3418-linkerV-V L 4715 . 

25 Both translational units are preceded by a ribosome 

binding site and DNA encoding a pelB leader sequence. 
This plasmid was obtained by a three -point ligation by 
mixing the vector resulting from pGOSA.D after removal of 
the V H 3418- encoding Pstl-SacI insert with the Pstl-Nhel 

30 pGOSA.C insert containing V H .4715 linked to V H .3418 and 
the PCR-III Nhel/SacI fragment (see Figure 25) . The 
remaining Pstl-SacI pGOSA.D vector contains the 5 1 end of 
the framework- 1 region of V H .3418 upto the PstI 
restriction site and V L .3418 linked by the 

35 (Gly 4 Ser) a Gly 4 Val linker to V L .4715 starting from the Sad 

restriction site in V L .3418. The Pstl-Whel pGOSA.C insert 
contains V H .4715 linked by the (Gly 4 Ser) 3 AlaGlySerAla 
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linker to V H .3418, starting from the Ps tl restriction site 
in the framework-1 region in V H .4715. The Nhel-SacI PCR- 
III fragment provides the ribosome binding site and DNA 
encoding the pelB leader sequence for the V L .34l8- 
5 (Gly 4 Ser) 2 Gly 4 Val-V L -4715 construct. The oligonucleotides 

DBL.10 and PCR-116 (see Table 1) used to generate PCR-III 
were designed to match the sequence upstream of the 
ribosome binding site of V L .4715 in Pv.4715 and to 
introduce an Nhel restriction site (DBL.10), and to match 
10 the framework-4 region of V L .3418 (PCR.116) . 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: UNILEVER PLC 

(B) STREET: Blackfriars 

(C) CITY: London 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP) : EC4P 4BQ 

(G) TELEPHONE: (01234) 222644 

(H) TELEFAX: (01234) 222633 

(I) TELEX: 82229 UN I LAB G 

(ii) TITLE OF INVENTION: Multivalent and mul ti specif ic antigen-binding 
protein 

(iii) NUMBER OF SEQUENCES: 27 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: 

Patentln Release #1.0, Version #1.25 (EPO) (SEC ID NO. 1 to 18) 
Patentln Release #1.0, Version #1.30 (EPO) (SEC ID NO. 19 to 27) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
AAGCTTGCAT GGAAATTCTA TTTCAAGGAG ACAGTCATAA TGAAATACCT ATTGCCTACG 6 0 



GCAGCCGCTG GATTGTTATT ACTCGCTGCC CAACCAGCGA TGGCCCAGGT GCAGCTGCAG 
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GAGTCAGGGG GAGACTTAGT GAAGCCTGGA GGGTCCCTGA CACTCTCCTG TGCAACCTCT 180 

GGATTCACTT TCAGTAGTTA TGCCTTTTCT TGGGTCCGCC AGACCTCAGA CAAGAGTCTG 240 

GAGTGGGTCG CAACCATCAG TAGTACTGAT ACTTATACCT ATTATTCAGA CAATGTGAAG 3 00 

GGGCGCTTCA CCATCTCCAG AGACAATGGC AAGAACACCC TGTACCTGCA AATGAGCAGT 360 

CTGAAGTCTG AGGACACAGC CGTGTATTAC TGTGCAAGAC ATGGGTACTA TGGTAAAGGC 420 

TATTTTGACT ACTGGGGCCA AGGGACCACG GTCACCGTCT CCTCAGGTGG AGGCGGTTCA 480 

GGCGGAGGTG GCTCTGGCGG TGGCGGATCG GCCGGTTCGG CCCAGGTCCA GCTGCAACAG 540 

TCAGGACCTG AGCTGGTAAA GCCTGGGGCT TCAGTGAAGA TGTCCTGCAA GGCTTCTGGA 600 

TACACATTCA CTAGCTATGT TATGCACTGG GTGAAACAGA AGC CTGGGCA GGGCCTTGAG 660 

TGGATTGGAT ATATTTATCC TTACAATGAT GGTACTAAGT ACAATGAGAA GTTCAAAGGC 720 

AAGGCCACAC TGACTTCAGA CAAATCCTCC AGCACAGCCT ACATGGAGCT CAGCAGCCTG 780 

ACCTCTGAGG ACTCTGCGGT CTATTACTGT TCAAGACGCT TTGACTACTG GGGCCAAGGG 840 

ACCACCGTCA CCGTCTCCTC ATAATAAGCT AGCGGAGCTG CATGCAAATT CTATTTCAAG 900 

GAGACAGTCA TAATGAAATA CCTATTGCCT ACGGCAGCCG CTGGATTGTT ATTACTCGCT 960 

GCCCAACCAG CGATGGCCGA CATCGAGCTC ACCCAGTCTC CATCTTCCAT GTATGCATCT 1020 

CTAGGAGAGA GAATCACTAT CACTTGCAAG GCGAGTCAGG ACATTAATAC CTATTTAACC 1080 

TGGTTCCAGC AGAAACCAGG GAAATCTCCC AAGACCCTGA TCTATCGTGC AAACAGATTG 114 0 

CTAGATGGGG TCCCATCAAG GTTCAGTGGC AGTGGATCTG GGCAAGATTA TTCTCTCACC 1200 

ATCAGCAGCC TGGACTATGA AGATATGGGA ATTTATTATT GTCTACAATA TGATGAGTTG 1260 

TACACGTTCG GAGGGGGGAC CAAGCTCGAG ATCAAACGGG GTGGAGGCGG TTCAGGCGGA 1320 

GGTGGCTCTG GCGGTGGCGG AGTCGACATC GAACTCACTC AGTCTCCATT CTCCCTGACT 1380 

GTGACAGCAG GAGAGAAGGT CACTATGAAT TGCAAGTCCG GTCAGAGTCT GTTAAACAGT 144 0 
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GTAAATCAGA GGAACTACTT GACCTGGTAC CAGCAGAAGC CAGGGCAGCC TCCTAAACTG 1500 

TTGATCTACT GGGCATCCAC TAGGGAATCT GGAGTCCCTG ATCGCTTCAC AGCCAGTGGA 1560 

TCTGGAACAG ATTTCACTCT CACCATCAGC AGTGTGCAGG CTGAAGACCT GGCAGTTTAT 1620 

TACTGTCAGA ATGATTATAC TTATCCGTTC ACGTTCGGAG GGGGGACCAA GCTCGAAATC 1680 

AAACGGGGAT CCGGTAGCGG GAACTCCGGT AAGGGGTACC TGAAGTAATA AGCGGCCGCG 174 0 

AATTC 1745 
(2) INFORMATION FOR SEQ ID NO: 2: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
AAGCTTGCAT GCAAATTCTA TTTCAAGGAG ACAGTCATAA TGAAATACCT ATTGCCTACG 60 

GCAGCCGCTG GATTGTTATT ACTCGCTGCC CAACCAGCGA TGGCCCAGGT GCAGCTGCAG 120 

GAGTCAGGAC CTGGCCTGGT GGCGCCCTCA CAGAGCCTGT CCATCACATG CACCGTCTCA 180 

GGGTTCTCAT TAACCGGCTA TGGTGTAAAC TGGGTTCGCC AGCCTCCAGG AAAGGGTCTG 240 

GAGTGGCTGG GAATGATTTG GGGTGATGGA AACACAGACT ATAATTCAGC TCTCAAATCC 3 00 

AGACTGAGCA TCAGCAAGGA CAACTCCAAG AGCCAAGTTT TCTTAAAAAT GAACAGTCTG 36 0 

CACACTGATG ACACAGCCAG GTACTACTGT GCCAGAGAGA GAGATTATAG GCTTGACTAC 420 

TGGGGCGAAG GCACCACGGT CACCGTCTCC TCAGGTGGAG GCGGTTCAGG CGGAGGTGGC 48 0 

TCTGGCGGTG GCGGATCGGA CATCGAGCTC ACCCAGTCTC CAGCCTCCCT TTCTGCGTCT 54 0 

GTGGGAGAAA CTGTCACCAT CACATGTCGA GCAAGTGGGA ATATTCACAA TTATTTAGCA 600 
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TGGTATCAGC AGAAACAGGG AAAATCTCCT CAGCTCCTGG TCTATTATAC AACAACCTTA 660 

GCAGATGGTG TGCCATCAAG GTTCAGTGGC AGTGGATCAG GAACACAATA TTCTCTCAAG 720 

ATCAACAGCC TGCAACCTGA AGATTTTGGG AGTTATTACT GTCAACATTT TTGGAGTACT 780 

CCTCGGACGT TCGGTGGAGG CACCAAGCTC GAGATCAAAC GGGAACAAAA ACTCATCTCA 840 

GAAGAGGATC TGAATTAATA AGATCAAACG GTAATAAGGA TCCAGCTCGA ATTC 894 
(2) INFORMATION FOR SBQ ID NO: 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

AAGCTTGCAT GCAAATTCTA TTTCAAGGAG ACAGTCATAA TGAAATACCT ATTGCCTACG 60 

GCAGCCGCTG GATTGTTATT ACTCGCTGCC CAACCGGCCA TGGCCCAGGT GCAGCTGCAG 12 0 

GAGTCAGGGG GAGACTTAGT GAAGCCTGGA GGGTCCCTGA CACTCTCCTG TGCAACCTCT 18 0 

GGATTCACTT TCAGTAGTTA TGCCTTTTCT TGGGTCCGCC AGACCTCAGA CAAGAGTCTG 240 

GAGTGGGTCG CAACCATCAG TAGTACTGAT ACTTATACCT ATTATTCAGA CAATGTGAAG 3 00 

GGGCGCTTCA CCATCTCCAG AGACAATGGC AAGAACACCC TGTACCTGCA AATGAGCAGT 360 

CTGAAGTCTG AGGACACAGC CGTGTATTAC TGTGCAAGAC ATGGGTACTA TGGTAAAGGC 420 

TATTTTGACT ACTGGGGCCA AGGGACCACG GTCACCGTCT CCTCAGGTGG AGGCGGTTCA 480 

GGCGGAGGTG GCTCTGGCGG TGGCGGATCG GACATCGAGC TCACTCAGTC TCCATTCTCC 540 

CTGACTGTGA CAGCAGGAGA GAAGGTCACT ATGAATTGCA AGTCCGGTCA GAGTCTGTTA 600 

AACAGTGTAA ATCAGAGGAA CTACTTGACC TGGTACCAGC AGAAGCCAGG GCAGCCTCCT 66 0 
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AAACTGTTGA TCTACTGGGC ATCCACTAGG GAATCTGGAG TCCCTGATCG CTTCACAGCC 720 

AGTGGATCTG GAACAGATTT CACTCTCACC ATCAGCAGTG TGCAGGCTGA AGACCTGGCA 780 

GTTTATTACT GTCAGAATGA TTATACTTAT CCGTTCACGT TCGGAGGGGG GACCAAGCTC 840 

GAGATCAAAC GGGGATCCGG TAGCGGGAAC TCCGGTAAGG GGTACCTGAA GTAATAAGAT 900 

CAAACGGTAA TAAGGATCCA GCTCGAATTC 93 0 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CCATGGGATG GAGCTGTATC ATCCTCTTCT TGGTAGCAAC AGCTACAGGT AAGGGGCTCA 60 
CAGTAGCAGG CTTGAGGTCT GGACATATAT ATGGGTGACA ATGACATCCA CTTTGCCTTT 120 
CTCTCCACAG GTGTCCACTC CCAGGTCCAA CTGCAG 15 6 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
AGGTSMAMCT GCAGSAGTCW GG 22 
(2) INFORMATION FOR SEQ ID NO: 6: 
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{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 
TGAGGAGACG GTGACCGTGG TCCCTTGGCC CC 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Bingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
GACATTGAGC TCACCCAGTC TCCA 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 

GTTAGATCTC GAGCTTGGTC CC 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 45 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CA66ATCCGG CCGGTTCGGC CCAGGTCCAG CTGCAACAGT CAGGA 45 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTACATGAAT TCGCTAGCTT ATTATGAGGA GACGGTGACG GTGGTCCCTT GGC 53 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ATTGGAGTCG ACATCGAACT CACTCAGTCT CCATTCTCC 39 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUBNCB CHARACTERISTICS : 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DMA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

CGAATTCGGA TCCCCGTTTG ATTTCGAGCT TGGTCC 

(2) INFORMATION POR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GAGCGCGAGC TCGGCCGAAC CGGCCGATCC GCCACCGCCA GAGCC 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
ATTGTCGAAT TCGTCGACTC CGCCACCGCC AGAGCC 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUBNCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



AGCTTCTAGA CCACCATGGA AAACTGCAGA GCTCAAAAGC TAGCGCGGCG GCTCTAG 



57 



(2) INFORMATION FOR SEQ ID NO: 16: 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE : nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: DNA (genomic) 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



AATTCTAGAG CGGCCGCGCT AGCTTTTGAG CTCTGCAGTT TTCCATGGTG GTCTAGA 



57 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ACGGGTGAGC TCGATGTCGG AGTGGACACC TGTGGAGAGA 4 0 

(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



GGAAACAGCT ATGACCATGA TTAC 24 
(2) INFORMATION FOR SBQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: primer DBL.7 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 19: 



CACCATCTCC AGAGACAATG GCAAG 25 
(2) INFORMATION FOR SBQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: primer DBL.8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



ACCAAGCTCG AGATCAAACG GGG 23 
(2) INFORMATION FOR SBQ ID NO: 21: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 50 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /deec = "synthetic DNA" 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: primer DBL.9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TGAAGTGAAT TCGCGGCCGC TTATTACCGT TTGATTTCGA GCTTGGTCCC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: primer DBL.10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
TAATAAGCTA GCGGAGCTGC ATGCAAATTC TATTTC 
(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 737 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION : /deec « "cDNA domains with synthetic linker(B) " 



(vii) IMMBDIATB SOURCE: 
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(B) CLONE: BcoRI-Hindlll insert of pUR4124 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 11. -730 

(D) OTHER INFORMATION : /product* H VLlys-GS-VHlys" 



(ix) FEATURE: 

(A) NAME /KEY : raat_peptide 

(B) LOCATION : 11 . . 334 

(D) OTHER INFORMATION: /product^ °VLlys° 



(ix) FEATURE: 

(A) NAME /KEY : miec_RNA 

(B) L0CATION:335. .379 

(D) OTHER INFORMATION : /product «= "(Gly4Ser)3 linker" 



(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 38 0. .727 

(D) OTHER INFORMATION : /product* "VHlys" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

GAATTCGGCC GAC ATC GAG CTC ACC CAG TCT CCA GCC TCC CTT TCT GCG 49 
Asp lie Glu Leu Thr Gin Ser Pro Ala Ser Leu Ser Ala 
15 10 



TCT GTG GGA GAA ACT GTC ACC ATC ACA TGT CGA GCA AGT GGG AAT ATT 97 
Ser Val Gly Glu Thr Val Thr He Thr Cye Arg Ala Ser Gly Asn He 
15 20 25 

CAC AAT TAT TTA GCA TGG TAT CAG CAG AAA CAG GGA AAA TCT CCT CAG 145 
His Asn Tyr Leu Ala Trp Tyr Gin Gin Lys Gin Gly Lys Ser Pro Gin 
30 35 40 45 



CTC CTG GTC TAT TAT ACA ACA ACC TTA GCA GAT GGT GTG CCA TCA AGG 193 
Leu Leu Val Tyr Tyr Thr Thr Thr Leu Ala Asp Gly Val Pro Ser Arg 
50 55 60 



TTC AGT GGC AGT GGA TCA GGA ACA CAA TAT TCT CTC AAG ATC AAC AGC 
Phe Ser Gly Ser Gly Ser Gly Thr Gin Tyr Ser Leu Lys He Asn Ser 
65 70 75 



241 
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CTG CAA CCT GAA GAT TTT GGG AGT TAT TAC TGT CAA CAT TTT TGG AGT 289 
Leu Gin Pro Glu Asp Phe Gly Ser Tyr Tyr Cys Gin His Phe Trp Ser 
80 85 90 

ACT CCT CGG ACG TTC GGT GGA GGG ACC AAG CTC GAG ATC AAA CGG GGT 33 7 

Thr Pro Arg Thr Phe Gly Gly Gly Thr Lys Leu Glu lie Lye Arg Gly 
95 100 105 

GGA GGC GGT TCA GGC GGA GGT GGC TCT GGC GGT GGC GGA TCG CAG GTG 385 
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gin Val 
HO 115 120 125 

CAG CTG CAG GAG TCA GGA CCT GGC CTG GTG GCG CCC TCA CAG AGC CTG 433 
Gin Leu Gin Glu Ser Gly Pro Gly Leu Val Ala Pro Ser Gin Ser Leu 
130 135 140 

TCC ATC ACA TGC ACC GTC TCA GGG TTC TCA TTA ACC GGC TAT GGT GTA 481 
Ser He Thr Cys Thr Val Ser Gly Phe Ser Leu Thr Gly Tyr Gly Val 
145 150 155 



AAC TGG GTT CGC CAG CCT CCA GGA AAG GGT CTG GAG TGG CTG GGA ATG 529 
Aen Trp Val Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp Leu Gly Met 
160 165 170 

ATT TGG GGT GAT GGA AAC ACA GAC TAT AAT TCA GCT CTC AAA TCC AGA 577 
He Trp Gly Asp Gly Asn Thr Asp Tyr Asn Ser Ala Leu Lys Ser Arg 
175 180 185 

CTG AGC ATC AGC AAG GAC AAC TCC AAG AGC CAA GTT TTC TTA AAA ATG 625 
Leu Ser He Ser Lys Asp Asn Ser Lys Ser Gin Val Phe Leu Lye Met 
190 195 200 205 

AAC AGT CTG CAC ACT GAT GAC ACA GCC AGG TAC TAC TGT GCC AGA GAG 673 
Asn Ser Leu Hie Thr Asp Asp Thr Ala Arg Tyr Tyr Cys Ala Arg Glu 
210 215 220 

AGA GAT TAT AGG CTT GAC TAC TGG GGC CAA GGG ACC ACG GTC ACC GTC 721 
Arg Asp Tyr Arg Leu Asp Tyr Trp Gly Gin Gly Thr Thr Val Thr Val 
225 230 235 



TCC TCA TGA TAAGCTT 
Ser Ser * 
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240 

(2) INFORMATION FOR SBQ ID NO : 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 920 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc ° "cDNA domains with synthetic linker <e)" 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hindlll-EcoRI insert Fv.3418 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 36 . .443 

(D) OTHER INFORMATION : /products "pelB- VH3418 " 

<ix) FEATURE: 

<A) NAME/KEY: sig_peptide 
(B) LOCATION: 36. .101 

(D) OTHER INFORMATION: /product- "pectate lyase" 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 102. .44 0 

(D) OTHER INFORMATION: /products "VH3418" 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:495. .864 

(D) OTHER INFORMATION: /product* *pelB-VL4318 " 

(ix) FEATURE: 

(A) NAME /KEY : sig_peptide 

(B) LOCATION: 4 95. .560 

(D) OTHER INFORMATION: /product <= "pectate lyase" 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 561. .881 
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(D) OTHER INFORMATION: /product* "VL3418" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

AA6CTTGCAA ATTCTATTTC AAGGAGACAG TCATA ATG AAA TAC CTA TTG CCT 53 

Met Lys Tyx Leu Leu Pro 

-22 -20 

ACG GCA GCC GCT GGA TTG TTA TTA CTC GCT GCC CAA CCA GCG ATG GCC 101 
Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met Ala 
-15 -10 -5 

CAG GTG CAG CTG CAG CAG TCA GGA CCT GAG CTG GTA AAG CCT GGG GCT 14 9 

Gin Val Gin Leu Gin Gin Ser Gly Pro Glu Leu Val Lys Pro Gly Ala 
15 10 15 

TCA GTG AAG ATG TCC TGC AAG GCT TCT GGA TAC ACA TTC ACT AGC TAT 197 
Ser Val Lya Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Ser Tyr 
20 25 30 

GTT ATG CAC TGG GTG AAA CAG AAG CCT GGG CAG GGC CTT GAG TGG ATT 245 
Val Met His Trp Val Lys Gin Lys Pro Gly Gin Gly Leu Glu Trp lie 
35 40 45 

GGA TAT ATT TAT CCT TAC AAT GAT GGT ACT AAG TAC AAT GAG AAG TTC 293 
Gly Tyr lie Tyr Pro Tyr Asn Asp Gly Thr Lys Tyr Asn Glu Lys Phe 
50 55 60 

AAA GGC AAG GCC ACA CTG ACT TCA GAC AAA TCC TCC AGC ACA GCC TAC 341 
Lys Gly Lys Ala Thr Leu Thr Ser Asp Lys Ser Ser Ser Thr Ala Tyr 
65 70 75 80 

ATG GAG CTC AGC AGC CTG ACC TCT GAG GAC TCT GCG GTC TAT TAC TGT 38 9 

Met Glu Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cys 
85 90 95 

TCA AGA CGC TTT GAC TAC TGG GGC CAA GGG ACC ACG GTC ACC GTC TCC 437 
Ser Arg Arg Phe Asp Tyr Trp Gly Gin Gly Thr Thr Val Thr Val Ser 
100 105 no 

TCA TAA TAAGAGCTAT GGGAGCTTGC ATGCAAATTC TATTTCAAGG AGACAGTCAT 493 
Ser * 



A ATG AAA TAC CTA TTG CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC 
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Met Lye Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu 
-22 -20 -15 -10 



GCT GCC CAA CCA GCG ATG GCC GAC ATC GAG CTC ACC CAG TCT CCA TCT 587 
Ala Ala Gin Pro Ala Met Ala Asp He Glu Leu Thr Gin Ser Pro Ser 
-5 15 



TCC ATG TAT GCA TCT CTA GGA GAG AGA ATC ACT ATC ACT TGC AAG GCG 635 
Ser Met Tyr Ala Ser Leu Gly Glu Arg He Thr He Thr Cys Lys Ala 
10 15 20 25 

AGT CAG GAC ATT AAT ACC TAT TTA ACC TGG TTC CAG CAG AAA CCA GGG 683 
Ser Gin Asp He Asn Thr Tyr Leu Thr Trp Phe Gin Gin Lye Pro Gly 
30 35 40 

AAA TCT CCC AAG ACC CTG ATC TAT CGT GCA AAC AGA TTG CTA GAT GGG 731 
Lys Ser Pro Lys Thr Leu He Tyr Arg Ala Asn Arg Leu Leu Asp Gly 
45 50 55 



GTC CCA TCA AGG TTC AGT GGC AGT GGA TCT GGG CAA GAT TAT TCT CTC 779 
Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Gin Asp Tyr Ser Leu 
60 65 70 

ACC ATC AGC AGC CTG GAC TAT GAA GAT ATG GGA ATT TAT TAT TGT CTA 827 
Thr He Ser Ser Leu Asp Tyr Glu Asp Met Gly He Tyr Tyr Cys Leu 
75 80 85 



CAA TAT GAT GAG TTG TAC ACG TTC GGA GGG GGG ACC AAG CTC GAG ATC 875 
Gin Tyr Asp Glu Leu Tyr Thr Phe Gly Gly Gly Thr Lys Leu Glu He 
90 95 100 105 

AAA CGG TAA TAATGATCAA ACGGTATAAG GATCCAGCTC GAATTC 920 
Lys Arg * 



(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) 



MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /deoc = "cDNA domains with synthetic linker(s) w 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hindlll-BcoRI insert of Fv.4715-myc 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION :4 0 . ,468 

(D) OTHER INFORMATION: /products B pelB-VH47l5" 



(ix) FEATURE: 

(A) NAME/KEY : sig_peptide 
(B> LOCATION : 4 0 . .105 

(D) OTHER INFORMATION: /product = "pectate lyase" 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 106. .465 

(D) OTHER INFORMATION : /product = "VH4715" 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 520 . . 963 

(D) OTHER INFORMATION: /product- "pelB-VL4 715-myc" 

(ix) FEATURE: 

(A) NAME /KEY : sig_peptide 

(B) LOCATION : 520 . .585 

(D) OTHER INFORMATION: /products "pectate lyase" 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 586. .927 

(D) OTHER INFORMATION: /product = "VL4715" 

(ix) FEATURE: 

(A) NAME /KEY : miec_RNA 

(B) LOCATION: 928. .960 

(D) OTHER INFORMATION: /product « "myc-tag* 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



AAGCTTGCAT GCAAATTCTA TTTCAAGGAG ACAGTCATA ATG AAA TAC CTA TTG 

Met Lys Tyr Leu Leu 
-22 -20 
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CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC GCT GCC CAA CCA GCG ATG 102 
Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met 
-15 -10 -5 

GCC CAG GTG CAG CTG CAG GAG TCA GGG GGA GAC TTA GTG AAG CCT GGA 150 
Ala Gin Val Gin Leu Gin Glu Ser Gly Gly Aep Leu Val Lys Pro Gly 
15 10 15 

GGG TCC CTG ACA CTC TCC TGT GCA ACC TCT GGA TTC ACT TTC AGT AGT 198 
Gly Ser Leu Thr Leu Ser Cye Ala Thr Ser Gly Phe Thr Phe Ser Ser 
20 25 30 

TAT GCC TTT TCT TGG GTC CGC CAG ACC TCA GAC AAG AGT CTG GAG TGG 24 6 

Tyr Ala Phe Ser Trp Val Arg Gin Thr Ser Asp Lys Ser Leu Glu Trp 
35 40 45 

GTC GCA ACC ATC AGT AGT ACT GAT ACT TAT ACC TAT TAT TCA GAC AAT 2 94 

Val Ala Thr lie Ser Ser Thr Asp Thr Tyr Thr Tyr Tyr Ser Asp Aen 
50 55 60 

GTG AAG GGG CGC TTC ACC ATC TCC AGA GAC AAT GGC AAG AAC ACC CTG 342 
Val Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Gly Lys Asn Thr Leu 
65 70 75 

TAC CTG CAA ATG AGC AGT CTG AAG TCT GAG GAC ACA GCC GTG TAT TAC 3 90 

Tyr Leu Gin Met Ser Ser Leu Lys Ser Glu Asp Thr Ala Val Tyr Tyr 
80 85 90 95 

TGT GCA AGA CAT GGG TAC TAT GGT AAA GGC TAT TTT GAC TAC TGG GGC 438 
Cye Ala Arg His Gly Tyr Tyr Gly Lys Gly Tyr Phe Asp Tyr Trp Gly 
100 105 110 

CAA GGG ACC ACG GTC ACC GTC TCC TCA TAA TAAGAGCTAT GGGAGCTTGC 488 
Gin Gly Thr Thr Val Thr Val Ser Ser * 
115 120 

ATGCAAATTC TATTTCAAGG AGACAGTCAT A ATG AAA TAC CTA TTG CCT ACG 540 

Met Lys Tyr Leu Leu Pro Thr 
-22 -20 



GCA GCC GCT GGA TTG TTA TTA CTC GCT GCC CAA CCA GCG ATG GCC GAC 
Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met Ala Asp 
-15 -10 -5 1 
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ATC GAG CTC ACT CAG TCT CCA TTC TCC CTG ACT GTG ACA GCA GGA GAG 636 
lie Glu Leu Thr Gin Ser Pro Phe Ser Leu Thr Val Thr Ala Gly Glu 
5 10 15 



AAG GTC ACT ATG AAT TGC AAG TCC GGT CAG AGT CTG TTA AAC AGT GTA 684 
Lys Val Thr Met Asn Cys Lys Ser Gly Gin Ser Leu Leu Asn Ser Val 
20 25 30 



AAT CAG AGG AAC TAC TTG ACC TGG TAC CAG CAG AAG CCA GGG CAG CCT 732 
Asn Gin Arg Asn Tyr Leu Thr Trp Tyr Gin Gin Lys Pro Gly Gin Pro 
35 40 45 



CCT AAA CTG TTG ATC TAC TGG GCA TCC ACT AGG GAA TCT GGA GTC CCT 780 
Pro Lys Leu Leu lie Tyr Trp Ala Ser Thr Arg Glu Ser Gly Val Pro 
50 55 60 65 

GAT CGC TTC ACA GCC AGT GGA TCT GGA ACA GAT TTC ACT CTC ACC ATC 828 
Asp Arg Phe Thr Ala Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie 
70 75 80 

AGC AGT GTG CAG GCT GAA GAC CTG GCA GTT TAT TAC TGT CAG AAT GAT 876 
Ser Ser Val Gin Ala Glu Asp Leu Ala Val Tyr Tyr Cys Gin Asn Asp 
85 90 95 

TAT ACT TAT CCG TTC ACG TTC GGA GGG GGG ACC AAG CTC GAG ATC AAA 924 
Tyr Thr Tyr Pro Phe Thr Phe Gly Gly Gly Thr Lys Leu Glu He Lys 
100 105 HO 

CGG GAA CAA AAA CTC ATC TCA GAA GAG GAT CTG AAT TAA TAAGATCAAA 973 
Arg Glu Gin Lys Leu He Ser Glu Glu Asp Leu Asn * 
115 120 125 



CGGTAATAAG GATCCAGCTC GAATTC 

(2) INFORMATION FOR SBQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULB TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "cDNA domains with synthetic linker (s) « 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hindlll-EcoRI insert of scPv. 4715-myc 

(ix) FEATURE: 

(A) NAME /KEY : sig_peptide 

(B) LOCATION: 40. .105 

(D) OTHER INFORMATION: /product- "pectate lyase" 

(ix) FEATURE : 

(A) NAME/KEY: raat_peptide 

(B) LOCATION: 106. .465 

(D) OTHER INFORMATION : /product- "VH4715" 

(ix) FEATURE: 

(A) NAME /KEY: misc_RNA 

(B) LOCATION : 4 66 . .510 

(D) OTHER INFORMATION : /product- " (Gly4Ser) 3 -linker" 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 511. .852 

(D) OTHER INFORMATION: /product- "VL4715- 

(ix) FEATURE: 

<A) NAME /KEY : raiec_RNA 
(B) LOCATION: 8 53 . .8 85 

(D) OTHER INFORMATION -./product- "myc-tag" 

( ix) FEATURE : 

(A) NAME/KEY: CDS 
- (B) LOCATION: 4 0. .888 

<D) OTHER INFORMATION : /product - 

«pelB-VH4715- (Gly4Ser) 3-VL4715-myc» 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

AAGCTTGCAT GCAAATTCTA TTTCAAGGAG ACAGTCATA ATG AAA TAC CTA TTG 54 

Met Lys Tyr Leu Leu 
-22 -20 



CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC GCT GCC CAA CCA GCG ATG 
Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met 

-15 -10 -5 



102 



WO 97/38102 



PCT/EP97/01609 



- 51 - 

GCC CAG GTG CAG CTG CAG GAG TCA GGG GGA GAC TTA GTG AAG CCT GGA 150 
Ala Gin Val Gin Leu Gin Glu Ser Gly Gly Asp Leu Val Lye Pro Gly 
1 5 10 15 

GGG TCC CTG ACA CTC TCC TGT GCA ACC TCT GGA TTC ACT TTC AGT AGT 198 
Gly Ser Leu Thr Leu Ser Cys Ala Thr Ser Gly Phe Thr Phe Ser Ser 
20 25 30 

TAT GCC TTT TCT TGG GTC CGC CAG ACC TCA GAC AAG AGT CTG GAG TGG 246 
Tyr Ala Phe Ser Trp Val Arg Gin Thr Ser Asp Lye Ser Leu Glu Trp 
35 40 45 

GTC GCA ACC ATC AGT AGT ACT GAT ACT TAT ACC TAT TAT TCA GAC AAT 294 
Val Ala Thr He Ser Ser Thr Asp Thr Tyr Thr Tyr Tyr Ser Asp Aen 
50 55 60 

GTG AAG GGG CGC TTC ACC ATC TCC AGA GAC AAT GGC AAG AAC ACC CTG 342 
Val Lye Gly Arg Phe Thr He Ser Arg Asp Asn Gly Lye Aen Thr Leu 
65 70 75 

TAC CTG CAA ATG AGC AGT CTG AAG TCT GAG GAC ACA GCC GTG TAT TAC 3 90 

Tyr Leu Gin Met Ser Ser Leu Lys Ser Glu Asp Thr Ala Val Tyr Tyr 
80 85 90 95 

TGT GCA AGA CAT GGG TAC TAT GGT AAA GGC TAT TTT GAC TAC TGG GGC 438 
Cye Ala Arg His Gly Tyr Tyr Gly Lys Gly Tyr Phe Asp Tyr Trp Gly 
100 105 no 

CAA GGG ACC ACG GTC ACC GTC TCC TCA GGT GGA GGC GGT TCA GGC GGA 486 
Gin Gly Thr Thr Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly 
115 120 125 

GGT GGC TCT GGC GGT GGC GGA TCG GAC ATC GAG CTC ACT CAG TCT CCA 534 
Gly Gly Ser Gly Gly Gly Gly Ser Asp He Glu Leu Thr Gin Ser Pro 
130 135 140 

TTC TCC CTG ACT GTG ACA GCA GGA GAG AAG GTC ACT ATG AAT TGC AAG 582 
Phe Ser Leu Thr Val Thr Ala Gly Glu Lys Val Thr Met Asn Cys Lys 
145 ISO 155 

TCC GGT CAG AGT CTG TTA AAC AGT GTA AAT CAG AGG AAC TAC TTG ACC 630 
Ser Gly Gin Ser Leu Leu Aen Ser Val Asn Gin Arg Asn Tyr Leu Thr 
160 165 170 175 
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TGG TAC CAG CAG AAG CCA GGG CAG CCT CCT AAA CTG TTG ATC TAC TGG 
Trp Tyr Gin Gin Lye Pro Gly Gin Pro Pro Lys Leu Leu lie Tyr Trp 
180 185 190 



€78 



GCA TCC ACT AGG GAA TCT GGA GTC CCT GAT CGC TTC ACA GCC AGT GGA 726 
Ala Ser Thr Arg Glu Ser Gly Val Pro Aep Arg Phe Thr Ala Ser Gly 
195 200 205 

TCT GGA ACA GAT TTC ACT CTC ACC ATC AGC AGT GTG CAG GCT GAA GAC 774 
Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Ser Val Gin Ala Glu Asp 
210 215 220 



CTG GCA GTT TAT TAC TGT CAG AAT GAT TAT ACT TAT CCG TTC ACG TTC 822 
Leu Ala Val Tyr Tyr Cye Gin Aan Asp Tyr Thr Tyr Pro Phe Thr Phe 
225 230 235 

GGA GGG GGG ACC AAG CTC GAG ATC AAA CGG GAA CAA AAA CTC ATC TCA 870 
Gly Gly Gly Thr Lys Leu Glu lie Lye Arg Glu Gin Lye Leu lie Ser 
240 245 250 255 



GAA GAG GAT CTG AAT TAA TAAGATCAAA CGGTAATAAG GATCCAGCTC GAATTC 924 
Glu Glu Asp Leu Asn * 
260 

(2) INFORMATION FOR SBQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1706 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « "cDNA domains with synthetic linker (s) 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hindlll-EcoRI insert of pGOSA.B 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 40. .864 

(D) OTHER INFORMATION : /product « w pelB-VH4715-LiA-VH3418" 



(ix) FEATURE: 
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(A) NAME /KEY: eig_peptide 

(B) LOCATION:40. .105 

(D) OTHER INFORMATION : /product* "pectate lyase" 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION : 1 0 6 . .465 

(D) OTHER INFORMATION : /product* n VH4715" / 

(ix) FEATURE: 

(A) NAME/KEY: raisc_RNA 

(B) LOCATION: 46 6. .522 

<D) OTHER INFORMATION: /product- "linkerA 
(Gly4Ser) 3AlaGlySerAla B 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION : 523 . .861 

(D) OTHER INFORMATION : /product = "VH3418" 

<ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 913 . .1689 

(D) OTHER INFORMATION: /product = w pelB-VL34 18-LiV-VL4715 H 

(ix) FEATURE: 

(A) NAME /KEY : sig_peptide 

(B) LOCATION: 913 . . 978 

<D) OTHER INFORMATION: /product" "pectate lyase** 

(ix) FEATURE: 

(A) NAME/KEY: matjpeptide 

(B) LOCATION: 979. .1299 

(D) OTHER INFORMATION : /products -VL3418" 

(ix) FEATURE: 

(A) NAME /KEY : misc_RNA 

(B) LOCATION: 1300. .1344 

(D) OTHER INFORMATION: /product, "linker V 
(Gly4Ser) 2Gly4Val" 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1345. .1686 
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<D) OTHER INFORMATION : /product = "VL4715* 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

AAGCTTGCAT GGAAATTCTA TTTCAAGGAG ACAGTCATA ATG AAA TAC CTA TTG 54 

Met Lye Tyr Leu Leu 
-22 -20 

CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC GCT GCC CAA CCA GCG ATG 102 
Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met 
-15 -10 -5 



GCC CAG GTG CAG CTG CAG GAG TCA GGG GGA GAC TTA GTG AAG CCT GGA 150 
Ala Gin Val Gin Leu Gin Glu Ser Gly Gly Asp Leu Val Lye Pro Gly 
15 10 15 



GGG TCC CTG ACA CTC TCC TGT GCA ACC TCT GGA TTC ACT TTC AGT AGT 198 
Gly Ser Leu Thr Leu Ser Cys Ala Thr Ser Gly Phe Thr Phe Ser Ser 
20 25 30 

TAT GCC TTT TCT TGG GTC CGC CAG ACC TCA GAC AAG AGT CTG GAG TGG 246 
Tyr Ala Phe Ser Trp Val Arg Gin Thr Ser Asp Lys Ser Leu Glu Trp 
35 40 45 

GTC GCA ACC ATC AGT AGT ACT GAT ACT TAT ACC TAT TAT TCA GAC AAT 294 
Val Ala Thr He Ser Ser Thr Asp Thr Tyr Thr Tyr Tyr Ser Asp Asn 
50 55 60 

GTG AAG GGG CGC TTC ACC ATC TCC AGA GAC AAT GGC AAG AAC ACC CTG 342 
Val Lys Gly Arg Phe Thr He Ser Arg Asp Asn Gly Lye Asn Thr Leu 
65 70 75 

TAC CTG CAA ATG AGC AGT CTG AAG TCT GAG GAC ACA GCC GTG TAT TAC 3 90 

Tyr Leu Gin Met Ser Ser Leu Lys Ser Glu Asp Thr Ala Val Tyr Tyr 
60 85 90 95 

TGT GCA AGA CAT GGG TAC TAT GGT AAA GGC TAT TTT GAC TAC TGG GGC 438 
Cys Ala Arg His Gly Tyr Tyr Gly Lys Gly Tyr Phe Asp Tyr Trp Gly 
100 105 110 



CAA GGG ACC ACG GTC ACC GTC TCC TCA GGT GGA GGC GGT TCA GGC GGA 
Gin Gly Thr Thr Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly 
115 120 125 
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WO 97/38102 



PCT7EP97/01609 



- 55 - 

GGT GGC TCT GGC GGT GGC GGA TCG GCC GGT TCG GCC CAG GTC CAG CTG 534 
Gly Gly Ser Gly Gly Gly Gly Ser Ala Gly Ser Ala Gin Val Gin Leu 
130 135 140 

CAA CAG TCA GGA CCT GAG CTG GTA AAG CCT GGG GCT TCA GTG AAG ATG 582 
Gin Gin Ser Gly Pro Glu Leu Val Lys Pro Gly Ala Ser Val Lys Met 
145 150 155 

TCC TGC AAG GCT TCT GGA TAC ACA TTC ACT AGC TAT GTT ATG CAC TGG 630 
Ser Cys Lye Ala Ser Gly Tyr Thr Phe Thr Ser Tyr Val Met His Trp 
160 165 170 175 

GTG AAA CAG AAG CCT GGG CAG GGC CTT GAG TGG ATT GGA TAT ATT TAT 678 
Val Lys Gin Lys Pro Gly Gin Gly Leu Glu Trp He Gly Tyr He Tyr 
180 185 190 

CCT TAC AAT GAT GGT ACT AAG TAC AAT GAG AAG TTC AAA GGC AAG GCC 726 
Pro Tyr Asn Asp Gly Thr Lys Tyr Asn Glu Lys Phe Lys Gly Lys Ala 
195 200 205 

ACA CTG ACT TCA GAC AAA TCC TCC AGC ACA GCC TAC ATG GAG CTC AGC 774 
Thr Leu Thr Ser Asp Lys Ser Ser Ser Thr Ala Tyr Met Glu Leu Ser 
210 215 220 

AGC CTG ACC TCT GAG GAC TCT GCG GTC TAT TAC TGT TCA AGA CGC TTT 822 
Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cys Ser Arg Arg Phe 
225 230 235 

GAC TAC TGG GGC CAA GGG ACC ACC GTC ACC GTC TCC TCA TAA B 64 
Asp Tyr Trp Gly Gin Gly Thr Thr Val Thr Val Ser Ser * 
240 ~ 245 250 

TAAGCTAGCG GAGCTGCATG CAAATTCTAT TTCAAGGAGA CAGTCATA ATG AAA TAC 921 

Met Lys Tyr 

-22 -20 

CTA TTG CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC GCT GCC CAA CCA 969 
Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro 
-15 -10 -5 



GCG ATG GCC GAC ATC GAG CTC ACC CAG TCT CCA TCT TCC ATG TAT GCA 
Ala Met Ala Asp He Glu Leu Thr Gin Ser Pro Ser Ser Met Tyr Ala 
15 10 
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TCT CTA GGA GAG AGA ATC ACT ATC ACT TGC AAG GCG AGT CAG GAC ATT 
Ser Leu Gly Glu Arg He Thr He Thr Cye Lys Ala Ser Gin Asp He 
15 20 25 



1065 



AAT ACC TAT TTA ACC TGG TTC CAG CAG AAA CCA GGG AAA TCT CCC AAG 1113 
Asn Thr Tyr Leu Thr Trp Phe Gin Gin Lys Pro Gly Lys Ser Pro Lys 
30 35 40 45 



ACC CTG ATC TAT CGT GCA AAC AGA TTG CTA GAT GGG GTC CCA TCA AGG 1161 
Thr Leu He Tyr Arg Ala Aen Arg Leu Leu Asp Gly Val Pro Ser Arg 
50 55 60 



TTC AGT GGC AGT GGA TCT GGG CAA GAT TAT TCT CTC ACC ATC AGC AGC 1209 
Phe Ser Gly Ser Gly Ser Gly Gin Asp Tyr Ser Leu Thr He Ser Ser 
65 70 75 

CTG GAC TAT GAA GAT ATG GGA ATT TAT TAT TGT CTA CAA TAT GAT GAG 1257 
Leu Aep Tyr Glu Asp Met Gly He Tyr Tyr Cys Leu Gin Tyr Asp Glu 
80 85 90 

TTG TAC ACG TTC GGA GGG GGG ACC AAG CTC GAG ATC AAA CGG GGT GGA 1305 
Leu Tyr Thr Phe Gly Gly Gly Thr Lys Leu Glu He Lys Arg Gly Gly 
95 100 105 

GGC GGT TCA GGC GGA GGT GGC TCT GGC GGT GGC GGA GTC GAC ATC GAA 1353 
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Val Aep He Glu 
110 115 120 125 



CTC ACT CAG TCT CCA TTC TCC CTG ACT GTG ACA GCA GGA GAG AAG GTC 1401 
Leu Thr Gin Ser Pro Phe Ser Leu Thr Val Thr Ala Gly Glu Lye Val 
130 135 140 



ACT ATG AAT TGC AAG TCC GGT CAG AGT CTG TTA AAC AGT GTA AAT CAG 144 9 

Thr Met Asn Cye Lys Ser Gly Gin Ser Leu Leu Asn Ser Val Asn Gin 
145 150 155 

AGG AAC TAC TTG ACC TGG TAC CAG CAG AAG CCA GGG CAG CCT CCT AAA 14 97 

Arg Asn Tyr Leu Thr Trp Tyr Gin Gin Lys Pro Gly Gin Pro Pro Lys 
160 165 170 

CTG TTG ATC TAC TGG GCA TCC ACT AGG GAA TCT GGA GTC CCT GAT CGC 1545 

Leu Leu He Tyr Trp Ala Ser Thr Arg Glu Ser Gly Val Pro Asp Arg 
175 180 185 
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TTC ACA GCC AGT GGA TCT GGA ACA GAT 
Phe Thr Ala Ser Gly Ser Gly Thr Asp 
190 195 

GTG CAG GCT GAA GAC CTG GCA GTT TAT 
Val Gin Ala Glu Asp Leu Ala Val Tyr 
210 

TAT CCG TTC ACG TTC GGA GGG GGG ACC 
Tyr Pro Phe Thr Phe Gly Gly Gly Thr 
225 230 
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TTC ACT CTC ACC ATC AGC AGT 1593 
Phe Thr Leu Thr lie Ser Ser 
200 205 

TAC TGT CAG AAT GAT TAT ACT 1641 
Tyr Cys Gin Aen Asp Tyr Thr 
215 220 

AAG CTC GAA ATC AAA CGG TAA 1689 
Lys Leu Glu lie Lys Arg * 
235 



TAAGCGGCCG CGAATTC 



1706 
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CLAIMS 

1. A multivalent antigen binding protein comprising: 

a first polypeptide comprising, in series, 
three or more variable domains of an antibody 
heavy chain; and 

a second polypeptide comprising, in series, 
three or more variable domains of an antibody 
light chain, 

said first and second polypeptides being linked by 
association of the respective heavy chain and light 
chain variable domains, each associated variable 
domain pair forming an antigen binding site. 

2. A protein according to Claim 1 comprising a 
trivalent antigen binding protein. 

3. A protein according to Claim 1 or Claim 2 wherein 
the variable domains of the antibody heavy chain of 
said first polypeptide are linked by a peptide 
linker and the variable domains of the antibody 
light chain of said second polypeptide are linked by 
a peptide linker. 

4. A protein according to any one of Claims 1 to 3 
wherein the associated variable domain pair binding 
sites are able to bind different epitopes from each 
other. 

5. A protein according to any one of Claims 1 to 3 
wherein the associated variable domain pair binding 
sites are able to bind the same epitope as each 
other . 

6. Nucleotide sequences coding for the polypeptides of 
the multivalent antigen binding protein of any one 
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of the preceding claims. 

7. Nucleotide sequences according to Claim 6 contained 
in one or more expression vectors. 

8 . A host cell transformed with a vector according to 
Claim 7, and capable of expression of the nucleotide 
sequences to produce the polypeptides of the 
multivalent antigen binding protein. 

9 . A host cell according to Claim 8 wherein the 
polypeptides on expression associate to form the 
multivalent antigen binding protein. 

10. A process for preparing a multivalent antigen 
binding protein according to any one of Claims 1 to 
5 comprising 

(i) transforming one or more hosts by 
incorporating genes encoding said first and 
second polypeptides; 

(ii) expressing said genes and said host or 
hosts; and 

(iii) allowing said first and second 
polypeptides to associate to form the protein. 

11. A protein according to any one of Claims 1 to 5 for 
use in medicine. 

12. A diagnostic or therapeutic composition comprising a 
protein according to any one of Claims 1 to 5. 

13 . Use of composition according to Claim 12 in the 
preparation of an agent for use in diagnosis or 
therapy. 



14. 



A method of diagnosis or therapy comprising 
administering a protein according to any one of 
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Claims 1 to 5 . 

15 . Use of a protein according to any one of Claims 1 to 
5 in an immunoassay method or for purification. 
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Fig.lB. 



VH1 



VL1 



VL2 



VH2 



VH3 



VL3 




SUBSTITUTE SHEET (RULE 26) 



WO 97/38102 



PCIYEP97/01609 



pelB 



V55 



Fig.2A. M K Y L L P T 

AAGCTTGCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACG 

AAAGLLLLAAQPAMA {^jMl^^^jP} 

leader gcagccgctggattgttattactcgctgcccaaccagcgatggcccaggtgca gctgtag 



GAGTCAGGGGGAGACTTAGTGAAGCCTGGAGGGTCCCTGACACTCTCCTGTGCAACCTCT 



1 



GGATTCACTTTCAGTAGTTATGCCTTTTCTTGGGTCC^ 



VH4715 



GAGTGGGTCGCAACCATCAGTAGTACTGATACTTATACCTATTATTCAGA 



GGGCGCTTCACCATCTCCAGAGACAATGGCAAGAACACCCTCT 



CTGAAGTCTGAGGACACAGCCGTGTATTACTGTGCAAGACATGGGTACTATGGTAAAGGC 



G G G G S I 



TATTTTGACTACTGGGGCCAAGGGACCACfiQICACCGTCTCCTCAGGTGGAGGCGGTTCA 



GGCGGAGGTGGCTCTGGCGGTGGCGGATCGQCCfiGTICffiaCC 



TCAGGACCTGAGCTGGTAAAGCC^ 



TACACATTCACTAGCTATGTTATGCACTGGGTGAAA^ 



VH3418 



TGGATTGGATATATTTATCCTTACAATGATGGTACTAAGTACAATGAGAAGTTCAAAGGC 



AAGGCCACACTGACTTCAGACAAATCCTCCAGCACAGCCTACATGGAGCTCAGCAGCCTG 



ACCTCTGAGGACTCTGCGGTCTATTACTGTTCAAGACGCTTTGACTA 



^ACCA^CG^^ 

nelB MKYLLPTAAAGLLLLA 

leader gagacagtcataatgaaatacctattgcctacggcagccgctggattgttattactcgct 



A Q P A M A 
GCCCAACCAGCGATGGCCGACATCGAGCTCATC^ 



VL3418 



CTAGG AG AG AG AATC ACTATC ACTTGC AAGGCGAGTC AGGACATT AATACCTATTTAACC 



TGGTTCCAGCAGAAACCAGGGAAATCTCCCAAGACCCTGATCTATCGTGCAAACAGATTG 
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Fig.2B. 



CTAGATGGGGTCCCATCAAGGTTCAGTGGCAGTGGATCTGGGCAAGATTATTCTCTCACC 



\TCAGCAGCCTGGACTATGAAGATATGGGAATTTATTATTGTCTACAATA 



Linker 



G G G G S G G I 
TACACGTTCGGAGGGGGGACCAAGCTCGAGATCAAACGGGGTGGAGGCGGTTCAGGCGG 



| G G S G G G G V| 

GGTGGCTCTGGCGGTGGCGGAGTCGACATCGAACTCAC^AGTCT 



GTGACAGCAGGAGAGAAGGTCACTATGAATTGCAAGTCCGGTCAGAGTCT 



GTAAATCAGAGGAACTACTTGACCTGGT^CCAGCAGAAGCCAGGGCAG^ 



VL4715 @8jS 



TTGATCTACTGGGCATCCACTAGGGAATCTGGAGTCCCTGATCGCTTC 



TCTGGAACAGATTTCACTCTCACCATCAGCAGTGTGCAGGCTGAAGACCTG 



TACTGTCAGAATGATTATACTTATCCGTTCACGTTCGGA 

H y^-" ( QSGSGUSGKG y"~ETD 

9 AAACGGGGATCCGGTAGCGGGAACTCCGGTAAGGGGTACCTGAAGTAATAA GCGGCrnr n 



AATTC 
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Fig.3A. 



M K Y L L P T 
AAGCTTGCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACG 



pelB AAAGLLLLAAQPAMA fc'flRSfe 

leader gcagccgctggattgttattactcgctgcccaaccagcgatggcccaottgcSS 



gagtcaggacctggcctggtggcgccctcacagagcctgtccatcacatgcaccgtc 



VHlys 



gggttctcattaaccggctatggtgtaaac 



gagtggctgggaatgatttggggtgatggaaacaca^ctataattcagc 



AGACTGAGCATCAGCAAGGACAACrCCAAGAGCCAAGTTTTCT^ 



'MM 



CACACTGATGACACAGCCAGGTACTACTGTGCCAGAGAGAGAGAT^ 



GGGGSGGGG l 

TGGGGCGAAGGCACCACGGTCACCGTCTCf*T^ 




VLlys 



TGGTATCAGCAGAAACAGGGAAAATCTCCTCAGCrrCCTGGTCT 



GCAGATGGTGTGCCATCAAGGTTCAGTGGCAGTGG^ 



ATCAACAGCCTGCAACCTGAAGATT^^ 



g Q K L I S> 



Myc-tag 



CCTCGGACGTTCGGTGGAGGCACCAAGOffi^ATCAAACGGGAACAAAAACTCATCTCA 



Ce E D L lp 

GAAGAGGATCTGAATTAATAAGATCAAACGGTAATAAGGATCCAGCTCGAATTC 
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Fig.3B. 



M K Y L L P T 
AAGCTTGCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATT 

pelB AAAGLLLLAAQPAMA ^^^^S^M 

leader gcagccgctggattgttattactcgctgcccaaccagcgatgg^ 




GAGTCAGGGGGAGACTTAGTGAAGCCTGGAGGGTCCCTGACACTCTCC^ 



GGATTCACTTTCAGTAGTTATGCCl-rri^ 



GAGTGGGTCGCAACCATCAGTAGTA 



VH4715 




CTGAAGTCTGAGGACACAGCCGTGTATTACTGTGCAAGACAT^ 



G G G G S 



TATTTTGACTACTGGGGCCAAGGGAC^ 

Linker | g g g g s g~ g g~g sT 



GGCGGAGGTGGCTCTGGCGGTGGCGGATCGGACATC GAC^ 



lCTCAGTCTCCATTCTCC 



CTGACTGTGACAGCAGGAGAGAAGGTCACTiSTGAATn 

mmmm 




VL4715 



AAACTGTTGATCTACTGGGCATCCACTAGGGAATCTC6AGTCCCTGM?C^^^^^^^CC 



mm 



AGTGGATCTGGAACAGMTTCACTCTCACCATCAGCAGTGTC ^^ 




Hydro2-tag 



MM< S sgsgnsgkgylk") 



G^ATCAAACGGGGATCCGGTAGCGGGAACTCCGGTAAGGGGTACCTGAAGTAATAAGAT 
CAAACGGTAATAAGGATCCAGCTCGAATTC 
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Fig.4. 
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EXON 2 
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Fig.6A. 
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Fig.7A. 
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Fig.7B. 
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Fig.8A. 



Xbal Pstl/Nhel/Xbal 



Ncol PstI 
leader sequence 




AmpR 



Insert Ncol /PstI leader 
fragment into Euka.pUC 



polvlinker 
Hincffll EcoR] 



Euka-oolvlinker 

Hindffl/Xbal/Ncol/Pstl/Sacl/ 
Nhel/Notl/Xbal/EcoRI 




Insert synthetic 
"Euka" polylinker 
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Euka.1/Euka.2 



AmpR 




AmpR 



Ncol Sad 
leader sequence 



Insert NcoI/SacI leader 
fragment into Euka.pUC 
(PGR fragment ML1/ML2) 



Xbal Sacl/Notl/Xbal 




AmpR 
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Fig.8B. 



Xbal Pstl/Nhel/Xba! 




VH-A linker VH-B 



Insert VH2 or VH3 Pstl/Nhel 
fragments from pUC constructs into 
Pstl/Nhel opened vector 




VH-A linker VH-B linker VH-C 




AmpR 



Excise Xbal /Xbal fragments 
from Euka constructs and insert into 
Xbal opened pSV.51 vector 



pSV.B/pSV.D/pSV.G/pSV.K/pSV.M 



SUBSTITUTE SHEET (RULE 26) 



WO 97/38102 



PCIYEP97/D1609 



^5 



Xbal Sacl/Notl/Xbal 




Fig.8C. 



Sad 



Amp R 



Insert VL2-HII or VL3-HII 
EcoRI-Klenow / SacI fragments 
from pUC constructs into 
Notl-Klenow / SacI vector 
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VL-A linker VL-B 



SacI 
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AmpR 
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from Euka constructs and insert into 
Xbal opened pSV.51 vector 
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Fig.11. 
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Fig. 12. 



GAA7TCnGCCGACATC£AGC2^ACCCAGTCTCCAGCCTCCCTTTCT 



VLLys 



AACTGTCACCATCACATGTCGAGCAAGTGGGAATATTCACAA 



GCAGAAACAGGGAAAATCTCCTCAGCTCCTGGTCTATTATACAACAACCTTAG^ 



TGTGCCATCAAGGTTCAGTGGCAGTGGATCAGGAACACAATATTCTCTCAAGATCAACAG 



CCTGCAACCTGAAGATTTTGGGAGTTATTACTGTCAACATTTTTC^ 



Linker 



h^gfek^ Iggggsggggs] 

GTTCGGTGGAACCAAGCTCG^ATCAAACGGGGTGGAGGCGGTTCAGGCGGAGGTGGCTC 



G G G G S 



TGGCGGTGGCGGATCGCAGGTGCAGC32GQ5i^AGTCAGGACCTGGCCTGGTGGCGCCCTC 



ACAGAGCCTGTCCATCACATGCACCGTCTCAGGGTTCTCATTAACCGGCTATGGTCT 



CTGGGTTCGCCAGCCTCCAGGAAAGGGTCTGGAGTGGC^ 



VHLyS AAACACAGACTATAATTCAGCTCTCAAATCCAGACTGAGCATCAGCAAGGA 



GAGCCAAGTTTTCTTAAAAATGAACAGTCTGCACACTGATGACAC^ 



TGCCAGAGAGAGAGATTATAGGCTTGACTACTGGGGCCAAGGGACCACGGTCACCGTCTC 



CTCATGATMfiCTZ! 
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la. 13. 

MKYLLPTA 
pelB AAGCTTGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTTC 

leader 

AAGLLLLAAQPAMA 



CCGCTGGATTGTTATTACTCGCTGCCCAACCAGC^^ 



VH3418 



-AGGACCTGAGCTGGTAAAGCCTGGGGCTTCAGTGAA 



ACACATTCACTAGCTATGTTATGCACTGGGTGAAACAGAAGCCTGGGCAGG 



GGATTGGATATATTTATCCTTACAATGATGGTACTAAGTACAATGAGAAGTO 



AGGCCACACTGACTTCAGACAAATCCTCCAGCACAGCCTACATGGAGCTCAGCAGCC 



CCTCTGAGGACTCTGCGGTCTATTACTGTTCAAGACGCTTTGACT 
CCACGQTCfl^ 

MKYLL PTAAAGLLLL 
AGGAGACAGTCATAATGAAATACCTATTGCCTACGGCAGCCGCTGGATTGTTATTACTCG 

leader a a q p a m a 




CTCTAGGAGAGAGAATCACTATCACTTGCAAGGCGAGTC 



CCTGCTTCCAGCAGA^ 



VL3418 




TGTACACGTTCGGAGGGGGGACCAAGCK3J^tf 
ATAAGGATCCAGCTCGAATTC 
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M K Y L L P T 

PSlB AAGCTTGCATGCAAATTCTATTTCAAGGAGACAGT^^ 

leader 

AAAGLLLLAAQPAMA lQIiKM^l|Qg-| 
GCAGCCGCTGGATTGTTATTACTCGCTGC^ 



GAGT(^GGGGGAGACITAGTGAAGCCTGGAGGGTCCC^ 



GGATTCACTTTCAGTAGTTATCCCTTTTCTTGGGTC 



VH4715 



GAGTGGGTCGCAACCATCAGTAGTACTGATACTTATACCTATTATTCAGACAATG 



— 



GGGCGCTTCACCATCTCCAGAGACAATGGCAAGAACACCCTGTACCTGCAAATGAGCAGT 



CTGAAGTCTGAGGACACAGCCGTGTATTACTGTGCAAGACATGGGTACTATGGTAAAGGC 



TATTTTGACTACTGGGGCCAAGGGACCAC GGTCACC ^^ 



M K Y L L P T 
GAGCTTGCATGCAAATTCTATTTCAAGGAGACAGTCATAATC 

leader . . . _ . . . 

L A A Q 



P A M A 



GCAGCCGCTGGATTGTTATTACTCGCTGCC^ 



A A A G L L L 



CAGTCTtTCATTCTCCCTGACTGTGACAGCAGGAG 



GGTCAGAGTCTGTTAAACAGTGTAAATCAGAGGAACTACTTG^ 



CCAGGGCAGCCTCCTAAACTGTTGATCTACTGGGCATCCACTAGGGAATCTGGAGTCCCT 



VL4715 



Silts 



GATCGCTTCACAGCCAGTGGATCTGGAACAGATTTCACTCTC 



GCTGAAGACCTGGCAGTTTATTACTGTCAGAATGATTATAC^ 



Myc-tag 



Cg Q K L I 



S E E D L tT> 



GGGGGGACCAAG£IS^£fiATCAAACGGGAACAAAAACTCATCTX!^GAAGAGGATCTGAA^ 



TAATAAGATCAAACGGTAATAAGGATCCAGCTCGAATTC 
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pelB 



M K Y L L P T 



leader AAGCOTGCATGCAAATOCTATTO 



AAAG LLLLAAQPAMA 



GCAGCCGCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCCAGGTGCAG£^^£AQ 



jAGTCAGGGGGAGACTTAGTGAAGCCTGGAGGGTCCCTGACACTCTC 



GGATTCACTTTCAGTAGTTATGCCTTTTCTTGGGTCCGCCA 



VH4715 GAGT6QGTCGC ^ 



GGGCGCTTCACCATCTCCAGAGACAATGGCAAGAACACCCTGTACCTGC 



CTGAAGTCTGAGGACACAGCCGTGTATTACTGTGCAAGACATGGCT 



Linker 



TATTTT< 



'GACTACTGGGGCCAAGGGACCACS2ICACCGTCTCCTCAGGTGGAGGCGGTTCA 



jGGGGSGGGGS 
GGCGGAGGTGGCTCTGGCGGTGGCGGATCGGACATC GAGC^ 



CTGACTGTGACAGCAGGAGAGAAGGTCACTATGAATTC^ 



VL4715 



AACAGTGTAAATCAGAGGAACTACTTGACCTGGTACCAGCAGAAGCCAGGG^ 



AAACTGT 



TGATCTACTGGGCATCXACTAGGGAATCTGGAGTCCCT 



AGTGGATCTGGAACAGATTTCACTCTCACCMCAGC 



GTTTATTACTGTCAGAATGATTATACTTATCCGTTCAi 



CGTTCGGAGGGGGGACCAAGCTC 



Myc-tag 



C EQKLISEEDLN^) 
£&GATCAAACGGGAACAAAAACTCATCTCAGAAGAGGATCTG 

GTAATAAGGATCCAGCTCGAATTC 



SUBSTITUTE SHEET (RULE 26) 



WO 97/38102 



pelB 
leader 



VH4715 



%5 



PCT/EP97/01609 



Fig.16A. 



M K Y L L P T 
AAGCTTGCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTG 

AAAGLLLLAAQPAMA 
GCAGCCGCTGGATTGTTATTACTCGCTGCCCAACC^^ 



GAGTCAGGGGGAGACTTAGTGAAGCCTGGAGGGTCCCTGACACTCTCCTGTGCAA(XTCT 



GGATTCACTTTCAGTAGTTATGCCTTTTCT 



GAGTGGGTCGCAACCATCAGTAGTACTGATACTTATACCTATTATTCAGAC^ 



GGGCGCTTCACCATCTCCAGAGACAATGGCAAGAACACCCTGTACCTG^ 

■ 

CTGAAGTCTGAGGACACAGCCGTGTATTACTCTGCAAGACATGGGTAOT 



u 

TATTTTGACTACTGGGGCCAAGGGACCAC GGTCACC GTCTC 



G G G G 



Linker 



GGGGSGGGGSAGSAj 



GGCGGAGGTGGCTCTC&CGGTGGCGGATCGGCra^ 




TACACATTCACTAGCTATGTTATGCACTGGGTGAAACAGAAGCCTGGGCAG 



VH3418 TGGATOGGATATA ' OTA ' E ^^ 



ACCACCGTCACCGTCTCCTCAT^^ 



pelB 



MKYLLPTAA 



G L L L L A 



leader gagaca gtcataatgaaatacctattgcctacggcagccgctggattgt^ 




VL34 1 8 CTAGGAGAGAGAATCACTATCAC ^ GCAA 



TGGTTCCAGCAGAAACCAGGGAAATCTCCCAAGACCCTGATCTATCGTGCA^^^TTC 
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CTAGATGGGCTCCCATCAAGGTTCACTXX3CACT 



ATCAGCAGCCTGGACTATGAAGATATGGGAATTTATTATTGTCTACAATATC 



Linker 



l lGGGGSGG} 
TACACGTTCGGAGGGGGGACCAAGCTCGAGATCAAACGGGGTGGAGGC^ 



| G G S G G G G V| f 

GGTGGCTCTGGCGGTGGCGGAGTCGACATCGAACTCACTC 



mimmmmmmsmmmm 



GTGACAGCAGGAGAGAAGGTCACTATGAATTGCAAGTCCGGTCAGAG 



GTAAATCAGAGGAACTACTTGACCTGGTACCAGCAGAAGCCAGGGCAGCCTCCT 



VL4715 



TTGATCTACTGGGCATCCACTAGGGAATCTGGAGTCCCTGATCGCTTCACAGC^ 



TCTGGAACAGATTTCACTCTCACCATCAGCAGTGTGCAGGC^ 



TACTGTCAGAATGATTATACTTATCCGTTCACGTTCGGAGGGGGGAC 



AAACGGTAATAA GCGGCCGC GAATTC 



SUBSTITUTE SHEET (RULE 26) 



WO 97/38102 PCT/EP97/01609 



2 &5 




SUBSTITUTE SHEET (RULE 26) 



WO 97/18102 



PCT/EP97/01609 



Fig. 17. 



Hindm EcpRl 
PsU BstEUSacl Xhol 




AmpR 



remove 
BstED/SacI 
fragment 



Hindi EcpRl 
PstI BstED SacI Xhol 




Amp R 



Sfil 

BstEI ISacJ 
PCR-I BstED /Sac] 



ligate BstEI /SacI 
(Gly 4 Ser) 3 linker 
into vector 
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AmpR 
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clone Xhol /EcoRI 
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^5 



SEQ-L& 
NO. 



5 PCR.51 

6 PCR.89 

7 PCR. 90 

8 PCR. 116 

9 DBL. 1 

10 DBL. 2 

11 DBL. 3 

12 DBL. 4 

13 DBL. 5 

14 DBL. 6 

15 Euka.l : 

16 Buka.2 : 

17 ML.l 

18 ML. 2 

19 DBL. 7 

20 DBL. 8 

21 DBL. 9 

22 DBL. 10 : 



5 f 
5' 
5' 
5» 



AGG T(C/G) (A/C) A(C/A)C TCC AG(C/G) AGT C{A/T)G G 
3' 

TGA GGA GAC GGT GAC CGT GGT CCC TTC GCC CC 3 ' 
GAC ATT GAG CTC ACC CAG TCT CCA 3' 
GTT AGA TCT CGA GCT TGG TCC C 3' 

CAG GAT CCG GCC GGT TOG GCC CAG GTC CAG CTC CAA 
CAG TCA GGA '3 

CTA CAT GAA TTC GCT AGC TTA TTA TGA GGA GAC GGT 
GAC GGT GGT CCC TTG GC f 3 

ATT GGA GTC GAC ATC GAA CTC ACT CAG TCT CCA TTC 
TCC 3' 

CGA ATT CGG ATC CCC GTT TGA TTT CGA GCT TGG TCC '3 
GAG CGC GAG CTC GGC CGA ACC GGC CGA TCC GCC ACC 
GCC AGA GCC f 3 

AAT GTC GAA TTC GTC GAC TCC GCC ACC GCC AGA GCC '3 
AGC TTC TAG ACC ACC ATC GAA AAC TCC AGA GCT CAA 
AAG CTA GCG CGG CGG CTC TAG '3 

AAT TCT AGA GCG GCC GCG CTA GCT TTT GAG CTC TCC 
AGT TTT CCA TGG TGG TCT AGA '3 

ACG GGT GAG CTC GAT GTC GGA GTG GAC ACC TCT GGA 
GAG A f 3 

GGA AAC AGC TAT GAC CAT GAT TAC '3 
CAC CAT CTC CAG AGA CAA TGG CAA G 3' 
ACC AAG CTC GAG ATC AAA CGG GG 3 1 
5' TGA AGT GAA TTC GOG GCC GCT TAT TAC CGT TTG ATT 

TOG AGC TTC GTC CC 3 » 
5' TAA TAA GCT AGC GGA GCT GCA TGG AAA TTC TAT TTC 3 1 

Table 1 



5' 

5' 

5" 

5* 



5' 
5» 
5» 
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Code Expressed Antibody fragment 



pSVB 


VH4715-VH3418 


pSVD 


VH4715-VHIys 


pSVG 


VH3418-VHIys 


pSVK 


VH4715-VHIys-VH3418 


pSVM : 


VHIys-VH4715-VH3418 


pSVN : 


VL3418-VL4715.2t 


pSVP : 


VUys-VL4715.2t 


pSVS : 


VLIys-VL3418.2t 


pSVV : 


VUys-VL471 5-VL341 8.2t 


pSVW : 


VL3418-VUys-VL4715.2t 
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